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Preface 


The first three editions of H.L.Royden’s Real Analysis have contributed to the education of 
generations of mathematical analysis students. This fourth edition of Real Analysis preserves 
the goal and general structure of its venerable predecessors—to present the measure theory, 
integration theory, and functional analysis that a modern analyst needs to know. 

The book is divided the three parts: Part I treats Lebesgue measure and Lebesgue 
integration for functions of a single real variable; Part II treats abstract spaces—topological 
spaces, metric spaces, Banach spaces, and Hilbert spaces; Part III treats integration over 
general measure spaces, together with the enrichments possessed by the general theory in 
the presence of topological, algebraic, or dynamical structure. 

The material in Parts II and III does not formally depend on Part I. However, a careful 
treatment of Part I provides the student with the opportunity to encounter new concepts in a 
familiar setting, which provides a foundation and motivation for the more abstract concepts 
developed in the second and third parts. Moreover, the Banach spaces created in Part I, the 
L? spaces, are one of the most important classes of Banach spaces. The principal reason for 
establishing the completeness of the L? spaces and the characterization of their dual spaces 
is to be able to apply the standard tools of functional analysis in the study of functionals and 
operators on these spaces. The creation of these tools is the goal of Part II. 


NEW TO THE EDITION 


e This edition contains 50% more exercises than the previous edition 


e Fundamental results, including Egoroff’s Theorem and Urysohn’s Lemma are now 
proven in the text. 


e The Borel-Cantelli Lemma, Chebychev’s Inequality, rapidly Cauchy sequences, and 
the continuity properties possessed both by measure and the integral are now formally 
presented in the text along with several other concepts. 


There are several changes to each part of the book that are also noteworthy: 


Part | 


e The concept of uniform integrability and the Vitali Convergence Theorem are now 
presented and make the centerpiece of the proof of the fundamental theorem of 
integral calculus for the Lebesgue integral 

e A precise analysis of the properties of rapidly Cauchy sequences in the L?( E) spaces, 
1 < p < ©, is now the basis of the proof of the completeness of these spaces 

e Weak sequential compactness in the L?(E) spaces, 1 < p < ov, is now examined in 
detail and used to prove the existence of minimizers for continuous convex functionals. 
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Part Il 


e General structural properties of metric and topological spaces are now separated into 
two brief chapters in which the principal theorems are proven. 

e In the treatment of Banach spaces, beyond the basic results on bounded linear 
operators, compactness for weak topologies induced by the duality between a Banach 
space and its dual is now examined in detail. 


e There is a new chapter on operators in Hilbert spaces, in which weak sequential com- 
pactness is the basis of the proofs of the Hilbert-Schmidt theorem on the eigenvectors 
of a compact symmetric operator and the characterization by Riesz and Schuader of 
linear Fredholm operators of index zero acting in a Hilbert space. 


Part Ill 


e General measure theory and general integration theory are developed, including the 
completeness, and the representation of the dual spaces, of the L?(X, 1) spaces for, 
1 < p < oo. Weak sequential compactness is explored in these spaces, including the 
proof of the Dunford-Pettis theorem that characterizes weak sequential compactness 
in L'(X, w). 

e The relationship between topology and measure is examined in order to characterize 
the dual of C(X), for a compact Hausdorff space X. This leads, via compactness 
arguments, to (i) a proof of von Neumann’s theorem on the existence of unique 
invariant measures on a compact group and (ii) a proof of the existence, for a mapping 
on a compact Hausdorf space, of a probability measure with respect to which the 
mapping is ergodic. 


The general theory of measure and integration was born in the early twentieth century. It 
is now an indispensable ingredient in remarkably diverse areas of mathematics, including 
probability theory, partial differential equations, functional analysis, harmonic analysis, and 
dynamical systems. Indeed, it has become a unifying concept. Many different topics can 
agreeably accompany a treatment of this theory. The companionship between integration 
and functional analysis and, in particular, between integration and weak convergence, has 
been fostered here: this is important, for instance, in the analysis of nonlinear partial 
differential equations (see L.C. Evans’ book Weak Convergence Methods for Nonlinear 
Partial Differential Equations [AMS, 1998}). 

The bibliography lists a number of books that are not specifically referenced but should 
be consulted for supplementary material and different viewpoints. In particular, two books 
on the interesting history of mathematical analysis are listed. 


SUGGESTIONS FOR COURSES: FIRST SEMESTER 


In Chapter 1, all the background elementary analysis and topology of the real line needed 
for Part 1 is established. This initial chapter is meant to be a handy reference. Core material 
comprises Chapters 2, 3, and 4, the first five sections of Chapter 6, Chapter 7, and the first 
section of Chapter 8. Following this, selections can be made: Sections 8.2-8.4 are interesting 
for students who will continue to study duality and compactness for normed linear spaces, 
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while Section 5.3 contains two jewels of classical analysis, the characterization of Lebesgue 
integrability and of Riemann integrability for bounded functions. 


SUGGESTIONS FOR COURSES: SECOND SEMESTER 


This course should be based on Part III. Initial core material comprises Section 17.1, Section 
18.1-18.4, and Sections 19.1-19.3. The remaining sections in Chapter 17 may be covered at 
the beginning or as they are needed later: Sections 17.3-17.5 before Chapter 20, and Section 
17.2 before Chapter 21. Chapter 20 can then be covered. None of this material depends on 
Part II. Then several selected topics can be chosen, dipping into Part II as needed. 


e Suggestion 1: Prove the Baire Category Theorem and its corollary regarding the partial 
continuity of the pointwise limit of a sequence of continuous functions (Theorem 7 of 
Chapter 10), infer from the Riesz-Fischer Theorem that the Nikodym metric space is 
complete (Theorem 23 of Chapter 18), prove the Vitali-Hahn-Saks Theorem and then 
prove the Dunford-Pettis Theorem. 


Suggestion 2: Cover Chapter 21 (omitting Section 20.5) on Measure and Topology, 
with the option of assuming the topological spaces are metrizable, so 20.1 can be 
skipped. 

Suggestion 3: Prove Riesz’s Theorem regarding the closed unit ball of an infinite 
dimensional normed linear space being noncompact with respect to the topology 
induced by the norm. Use this as a motivation for regaining sequential compactness 
with respect to weaker topologies, then use Helley’s Theorem to obtain weak sequential 
compactness properties of the L?(X, w) spaces, 1 < p < oo, if L7(X, w) is separable 
and, if Chapter 21 has already been covered, weak-* sequential compactness results 
for Radon measures on the Borel o-algebra of a compact metric space. 


SUGGESTIONS FOR COURSES: THIRD SEMESTER 


I have used Part II, with some supplemental material, for a course on functional analysis, 
for students who had taken the first two semesters; the material is tailored, of course, to that 
chosen for the second semester. Chapter 16 on bounded linear operators on a Hilbert space 
may be covered right after Chapter 13 on bounded linear operators on a Banach space, since 
the results regarding weak sequential compactness are obtained directly from the existence 
of an orthogonal complement for each closed subspace of a Hilbert space. Part II should be 
interlaced with selections from Part III to provide applications of the abstract space theory 
to integration. For instance, reflexivity and weak compactness can be considered in general 
L?(X,w) spaces, using material from Chapter 19. The above suggestion 1 for the second 
semester course can be taken in the third semester rather than the second, providing a truly 
striking application of the Baire Category Theorem. The establishment, in Chapter 21, of the 
representation of the dual of C(X), where X is a compact Hausdorff space, provides another 
collection of spaces, spaces of signed Radon measures, to which the theorems of Helley, 
Alaoglu, and Krein-Milman apply. By covering Chapter 22 on Invariant Measures, the 
student will encounter applications of Alaoglu’s Theorem and the Krein-Milman Theorem 
to prove the existence of Haar measure on a compact group and the existence of measures 
with respect to which a mapping is ergodic (Theorem 14 of Chapter 22), and an application 
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of Helley’s Theorem to establish the existence of invariant measures (the Bogoliubov-Krilov 
Theorem). 

I welcome comments at pmf@math.umd.edu. A list of errata and remarks will be 
placed on www.math.umd.edu/~pmf/RealAnalysis. 
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PART ONE 


LEBESGUE 
INTEGRATION FOR 
FUNCTIONS OFA 
SINGLE REAL 
VARIABLE 


Preliminaries on Sets, 
Mappings, and Relations 


Contents 
Unions and Intersections of Sets... 1... ee ee 3 
Equivalence Relations, the Axiom of Choice, and Zorn’s Lemma....... 5 


In these preliminaries we describe some notions regarding sets, mappings, and relations 
that will be used throughout the book. Our purpose is descriptive and the arguments given 
are directed toward plausibility and understanding rather than rigorous proof based on an 
axiomatic basis for set theory. There is a system of axioms called the Zermelo-Frankel 
Axioms for Sets upon which it is possible to formally establish properties of sets and thereby 
properties of relations and functions. The interested reader may consult the introduction 
and appendix to John Kelley’s book, General Topology [Kel75], Paul Halmos’s book, Naive 
Set Theory [Hal98], and Thomas Jech’s book, Set Theory [Jec06]. 


UNIONS AND INTERSECTIONS OF SETS 


For aset A,! the membership of the element x in A is denoted by x € A and the nonmembership 
of x in A is denoted by x ¢ A. We often say a member of A belongs to A and call a member of 
A a point in A. Frequently sets are denoted by braces, so that {x | statement about x} is the 
set of all elements x for which the statement about x is true. | 

Two sets are the same provided they have the same members. Let A and B be sets. We 
call A a subset of B provided each member of A is a member of B; we denote this by A C B 
and also say that A is contained in B or B contains A. A subset A of B is called a proper 
subset of B provided A # B. The union of A and B, denoted by A U B, is the set of all points 
that belong either to A or to B; that is, AU B= {x|x eA or x¢€ B}. The word or is used here 
in the nonexclusive sense, so that points which belong to both A and B belong to A U B. The 
intersection of A and B, denoted by AN B, is the set of all points that belong to both A and 
B; that is, AN B={x|x€A and x € B}. The complement of A in B, denoted by B~ A, is the 
set of all points in B that are not in A; that is, By A = {x|x eB, x ¢ A}. If, in a particular 
discussion, all of the sets are subsets of a reference set X, we often refer to X~A simply as 
the complement of A. 

The set that has no members is called the empty-set and denoted by 9. A set that is not 
equal to the empty-set is called nonempty. We refer to a set that has a single member as a 
singleton set. Given a set X, the set of all subsets of X is denoted by P(X) or 2*: it is called 
the power set of X. | 

In order to avoid the confusion that might arise when considering sets of sets, we 
often use the words “collection” and “family” as synonyms for the word “‘set.” Let F be - 
a collection of sets. We define the union of F, denoted by Ur. ¢ F, to be the set of points 


1The Oxford English Dictionary devotes several hundred pages to the definition of the word “set.” 
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that belong to at least one of the sets in F. We define the intersection of F, denoted by 
(res F, to be the set of points that belong to every set in ¥. The collection of sets F is said 
to be disjoint provided the intersection of any two sets in F is empty. For a family F of sets, 
the following identities are established by checking set inclusions. 


De Morgan's identities 


J r= (\[X~F] and X~ 


FeF FeFf 


X~ 


a | = J [X~F], 
FeF FeF 
that is, the complement of the union is the intersection of the complements, and the 
complement of the intersection is the union of the complements. 

For a set A, suppose that for each AeA, there is defined a set E,. Let F be the 
collection of sets {E, |A¢ A}. We write F = {Ej}, <, and refer to this as an indexing (or 
parametrization) of F by the index set (or parameter set) A. 


Mappings between sets 


Given two sets A and B, by a mapping or function from A into B we mean a correspondence 
that assigns to each member of A a member of B. In the case B is the set of real numbers 
we always use the word “function.” Frequently we denote such a mapping by f: AB, 
and for each member x of A, we denote by f(x) the member of B to which x is assigned. 
For a subset A’ of A, we define f(A’) = {b|b= f(a) for some member a of A’}: f(A’) is 
called the image of A’ under f. We call the set A the domain of the function f and f(A) 
the image or range of f. If f(A) = B, the function f is said to be onto. If for each member b 
of f(A) there is exactly one member a of A for which b= f(a), the function f is said to be 
one-to-one. A mapping f: A — B that is both one-to-one and onto is said to be invertible; 
we say that this mapping establishes a one-to-one correspondence between the sets A and B. 
Given an invertible mapping f: A — B, for each point b in B, there is exactly one member a 
of A for which f(a) =b and it is denoted by f—'(b). This assignment defines the mapping 
f-!: B—A, which is called the inverse of f. Two sets A and B are said to be equipotent 
provided there is an invertible mapping from A onto B. Two sets which are equipotent are, 
from the set-theoretic point of view, indistinguishable. 

Given two mappings f: A—> Band g: C—> D for which f(A) C C then the composition 
go f: A Dis defined by [g 0 f](x) = g( f(x)) for each x € A. It is not difficult to see that 
the composition of invertible mappings is invertible. For a set D, define the identity mapping 
idp: D> Dis defined by idp(x) = x for all xe D. A mapping f: A— B is invertible if and 
only if there is a mapping g: B— A for which 


go f =id, and f og = idg. 


Even if the mapping f: A— B is not invertible, for a set E, we define f~!(£) to be 
the set {a€ A| f(a) € E}; it is called the inverse image of E under f. We have the following 
useful properties: for any two sets FE, and E>, 


f-'(E, U Ey) = f-'(E}) U f1( Ea), f71( EN E2) = f7'( £1) 0 f"( Ea) 
and 
f-'(E.~ Ex) = f\(E1)~ fo'( £2). 
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Finally, for a mapping f: A—B and a subset A’ of its domain A, the restriction of f to A’, 
denoted by f|,4’, is the mapping from A’ to B which assigns f(x) to each x € A’. 


EQUIVALENCE RELATIONS, THE AXIOM OF CHOICE, AND ZORN’S LEMMA 


Given two nonempty sets A and B, the Cartesian product of A with B, denoted by A X B, is 
defined to be the collection of all ordered pairs (a, b) where a € A and be B and we consider 
(a, b) = (a’, b’) if and only if a =a’ and b = b’.” For a nonempty set X, we call a subset R 
of X XX arelation on X and write x R x’ provided (x, x’) belongs to R. The relation R is said 
to be reflexive provided x Rx, for all x € X; the relation R is said to be symmetric provided 
x Rx’ if x’ Rx; the relation R is said to be transitive provided whenever x Rx’ and x’ Rx", 
then x Rx”. 


Definition A relation R ona set X is called an equivalence relation provided it is reflexive, 
symmetric, and transitive. 


Given an equivalence relation R on a set X, for each x € X, the set R, = {x’|x’EX, x Rx’}is 
called the equivalence class of x (with respect to R). The collection of equivalence classes is 
denoted by X/R. For example, given a set X, the relation of equipotence is an equivalence 
relation on the collection 2* of all subsets of X. The equivalence class of a set with respect 
to the relation equipotence is called the cardinality of the set. 

Let R be an equivalence relation on a set X. Since R is symmetric and transitive, 
R, = Ry if and only if x Rx’ and therefore the collection of equivalence classes is disjoint. 
Since the relation R is reflexive, X is the union of the equivalence classes. Therefore X/R is 
a disjoint collection of nonempty subsets of X whose union is X. Conversely, given a disjoint 
collection F of nonempty subsets of X whose union is X, the relation of belonging to the 
same set in F is an equivalence relation R on X for which F = X/R. 

Given an equivalence relation on a set X, it is often necessary to choose a subset C 
of X which consists of exactly one member from each equivalence class. Is it obvious that 
there is such a set? Ernst Zermelo called attention to this question regarding the choice of 
elements from collections of sets. Suppose, for instance, we define two real numbers to be 
rationally equivalent provided their difference is a rational number. It is easy to check that 
this is an equivalence relation on the set of real numbers. But it is not easy to identify a set 
of real numbers that consists of exactly one member from each rational equivalence class. 


Definition Let F be a nonempty family of nonempty sets. A choice function f on F is a 
function f from F to Ure F with the property that for each set F in F, f(F) is a member 
of F. 


Zermelo’s Axiom of Choice Let F be a nonempty collection of nonempty sets. Then there is 
a choice function on F . 


2In a formal treatment of set theory based on the Zermelo-Frankel Axioms, an ordered pair (a, b) is defined to 
be the set {{a}, {a, b}} and a function with domain in A and image in B is defined to be a nonempty collection of 
ordered pairs in A X B with the property that if the ordered pairs (a, b) and (a, b’) belong to the function, then 
b=b'. 


6 Preliminaries on Sets, Mappings, and Relations 


Very roughly speaking, a choice function on a family of nonempty sets “chooses” a member 
from each set in the family. We have adopted an informal, descriptive approach to set theory 
and accordingly we will freely employ, without further ado, the Axiom of Choice. 


Definition A relation R on a set nonempty X is called a partial ordering provided it is 
reflexive, transitive, and, for x, x’ in X, 


ifx Rx’ and x’ Rx, thenx =x’. 


A subset E of X is said to be totally ordered provided for x, x’ in E, either x Rx’ or x' Rx. A 
member x of X is said to be an upper bound for a subset E of X provided x’ Rx for all x’ € E, 
and said to be maximal provided the only member x' of X for which x Rx’ is x’ = x. 


For a family F of sets and A, Be F, define ARB provided A C B. This relation of 
set inclusion is a partial ordering of F. Observe that a set F in F is an upper bound for a 
subfamily ¥’ of F provided every set in F’ is a subset of F and a set F in F is maximal 
provided it is not a proper subset of any set in F. Similarly, given a family F of sets and 
A, Be F define A R B provided B C A. This relation of set containment is a partial ordering 
of F. Observe that a set F in F is an upper bound for a subfamily ¥’ of F provided every 
set in F’ contains F and a set F in F is maximal provided it does not properly contain any 
setin F. 


Zorn’s Lemma Let X be a partially ordered set for which every totally ordered subset has an 
upper bound. Then X has a maximal member. 


We will use Zorn’s Lemma to prove some of our most important results, including the 
Hahn-Banach Theorem, the Tychonoff Product Theorem, and the Krein-Milman Theorem. 
Zorn’s Lemma is equivalent to Zermelo’s Axiom of Choice. For a proof of this equivalence 
and related equivalences, see Kelley [Kel75], pp. 31-36. 

We have defined the Cartesian product of two sets. It is useful to define the Carte- 
sian product of a general parametrized collection of sets. For a collecton of sets {E)} AEA 
parametrized by the set A, the Cartesian product of {E,}, <4, which we denote by IT, <4 E), is 
defined to be the set of functions f from A tol, <q E, such that for each Ae A, f(A) belongs 
to Ej. It is clear that the Axiom of Choice is equivalent to the assertion that the Cartesian 
product of a nonempty family of nonempty sets is nonempty. Note that the Cartesian product 
is defined for a parametrized family of sets and that two different parametrizations of the same 
family will have different Cartesian products. This general definition of Cartesian product is 
consistent with the definition given for two sets. Indeed, consider two nonempty sets A and B. 
Define A = {A;, Az} where A; #A2 and then define E), = Aand E), = B. The mapping that as- 
signs to the function f € IT, ¢, E, the ordered pair ( f(A1), f(Az)) is aninvertible mapping of 
the Cartesian product IT, <4 £, onto the collection of ordered pairs A X B and therefore these 
two sets are equipotent. For two sets E and A, define FE, = E forall A € A. Then the Cartesian 
product IT, <4 E, is equal to the set of all mappings from A to E.and is denoted by E“. 
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We assume the reader has a familiarity with the properties of real numbers, sets of real 
numbers, sequences of real numbers, and real-valued functions of a real variable, which are 
usually treated in an undergraduate course in analysis. This familiarity will enable the reader 
to assimilate the present chapter, which is devoted to rapidly but thoroughly establishing 
those results which will be needed and referred to later. We assume that the set of real 
numbers, which is denoted by R, satisfies three types of axioms. We state these axioms and 
derive from them properties on the natural numbers, rational numbers, and countable sets. 
With this as background, we establish properties of open and closed sets of real numbers; 
convergent, monotone, and Cauchy sequences of real numbers; and continuous real-valued 
functions of a real variable. 


1.1 THE FIELD, POSITIVITY, AND COMPLETENESS AXIOMS 
We assume as given the set R of real numbers such that for each pair of real numbers a and 
b, there are defined real numbers a + b and ab called the sum and product, respectively, of 
a and b for which the following Field Axioms, Positivity Axioms, and Completeness Axiom 
are Satisfied. 
The field axioms 
Commutativity of Addition: For all real numbers a and b, 
a+b=b-+a. 

Associativity of Addition: For all real numbers a, b, and c, 

(a+b)+c=a+(b+c). 
The Additive Identity: There is a real number, denoted by 0, such that 


0O+a=a+0=a for all real numbers a. 
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The Additive Inverse: For each real number a, there is a real number b such that 
a+b=0. 

Commutativity of Multiplication: For all real numbers a and b, 

ab = ba. 
Associativity of Multiplication: For all real numbers a, b, and c, 

(ab)c = a(bc). 
The Multiplicative Identity: There is a real number, denoted by 1, such that 
la=al=a for allreal numbers a. 

The Multiplicative Inverse: For each real number a #0, there is a real number b such that 

ab = 1. 
The Distributive Property: For all real numbers a, b, and c, 

a(b+c)=ab+ac. 


The Nontriviality Assumption: 
140. 


Any set that satisfies these axioms is called a field. It follows from the commutativity 
of addition that the additive identity, 0, is unique, and we infer from the commutativity 
of multiplication that the multiplicative unit, 1, also is unique. The additive inverse and 
multiplicative inverse also are unique. We denote the additive inverse of a by —a and, if a+0, 
its multiplicative inverse by a~! or 1/a. If we have a field, we can perform all the operations 
of elementary algebra, including the solution of simultaneous linear equations. We use the 
various consequences of these axioms without explicit mention.! 


The positivity axioms 


In the real numbers there is a natural notion of order: greater than, less than, and so on. 
A convenient way to codify these properties is by specifying axioms satisfied by the set of 
positive numbers. There is a set of real numbers, denoted by P, called the set of positive 
numbers. It has the following two properties: 


P1 Ifa and bare positive, then ab and a + b are also positive. 
P2 For a real number a, exactly one of the following three alternatives is true: 


ais positive, —a is positive, a=0. 


1A systematic development of the consequences of the Field Axioms may be found in the first chapter of the 
classic book A Survey of Modern Algebra by Garrett Birkhoff and Saunders MacLane [BM97]. 
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The Positivity Axioms lead in a natural way to an ordering of the real numbers: for real 
numbers a and b, we define a > b to mean that a — b is positive, and a > b to mean that a>b 
or a = b. We then define a < b to mean that b > a, and a < b to mean that b > a. 

Using the Field Axioms and the Positivity Axioms, it is possible to formally establish 
the familiar properties of inequalities (see Problem 2). Given real numbers a and b for which 
a <b, we define (a, b) = {x|a <x <5}, and say a point in (a, b) lies between a and b. We 
call a nonempty set J of real numbers an interval provided for any two points in J, all the 
points that lie between these points also belong to J. Of course, the set (a, b) is an interval, 
as are the following sets: 


[a, b] = {x | a<x<b};[a, b) ={x| a<x<b};(a, b] = {x | a<x<b}. (1) 


The completeness axiom 


A nonempty set E of real numbers is said to be bounded above provided there is a real 
number b such that x < bforallx € E : the number 5 is called an upper bound for E. 
Similarly, we define what it means for a set to be bounded below and for a number to be a 
lower bound for a set. A set that is bounded above need not have a largest member. But the 
next axiom asserts that it does have a smallest upper bound. 


The Completeness Axiom Let E be a nonempty set of real numbers that is bounded above. 
Then among the set of upper bounds for E there is a smallest, or least, upper bound. 


For a nonempty set E of real numbers that is bounded above, the least upper bound of 
E, the existence of which is asserted by the Completeness Axiom, will be denoted by 1.u.b. S. 
The least upper bound of E is usually called the supremum of E and denoted by sup S. It 
follows from the Completeness Axiom that every nonempty set E of real numbers that is 
bounded below has a greatest lower bound; it is denoted by g.l.b. E and usually called the 
infimum of F and denoted by inf E. A nonempty set of real numbers is said to be bounded 
provided it is both bounded below and bounded above. 


The triangle inequality 


We define the absolute value of a real number x, |x|, to be x if x > 0 and to be —x if x <0. 
The following inequality, called the Triangle Inequality, is fundamental in mathematical 
analysis: for any pair of real numbers a and b, 


la + b| < |a| + |DI. 


The extended real numbers 


It is convenient to introduce the symbols oo and —oo and write —oo < x < oo for all real 
numbers x. We call the set R U+oo the extended real numbers. If a nonempty set E of 
real numbers is not bounded above we define its supremum to be oo. It is also convenient 
to define —oo to be the supremum of the empty-set. Therefore every set of real numbers 
has a supremum that belongs to the extended real-numbers. Similarly, we can extend the 
concept of infimum so every set of real numbers has an infimum that belongs to the extended 
real numbers. We will define limits of sequences of real numbers and it is convenient to 
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allow limits that are extended real numbers. Many properties of sequences of real numbers 
that converge to real numbers, such as the limit of the sum is the sum of the limits and 
the limit of the product is the product of the limit, continue to hold for limits that are +00, 
provided we make the following extension of the meaning of sum and product: 00 + 00 = 00, 
—0o-—00 = —oo and, for each real number x, x+-00 = 00 and x—00 = — 00; ifx>0, x-00 = 00 
and x-(—00) = —oo while if x <0, x-00 = —oo and x-(—00) = 00. We define (—00, 00) = R. 
For a, b € R, we define 


(a,0o) = {xER | a<x}, (—00,b) = {xe R| x<bd} 
and 
[a,00) = {xeR | a<x}, (—o0, b] = {xe R| x <b}. 


Sets of the above form are unbounded intervals. We leave it as an exercise to infer from the 
completeness of R that all unbounded intervals are of the above type and that all bounded 
intervals are of the form listed in (1) together with intervals of the form (a, b). 


PROBLEMS 
1. For a#0 and b £0, show that (ab)~! = 71571. 


2. Verify the following: 
(i) For each real number a +0, a” > 0. In particular, 1 > 0 since 140 and 1 = 12. 


(ii) For each positive number a, its multiplicative inverse a~! also is positive. 
(ii) If a > b, then 
ac > bc if c > Qandac < be ifc <0. 
3. For a nonempty set of real numbers E, show that inf E = sup E if and only if E consists of a 
single point. 
4. Let a and b be real numbers. 
(i) Show that if ab = 0, thena = 0 orb = 0. 


(ii) Verify that a? — b” = (a—b)(a+5) and conclude from part (i) that if a? = b?, then 
a=bora=—b. 


(ii) Let c be a positive real number. Define E = {x €R| x? < c.} Verify that E is nonempty 
and bounded above. Define x9 = sup E. Show that x = c. Use part (ii) to show that 
there is a unique x > 0 for which x* = c. It is denoted by /c. 


5. Let a, b, and c be real numbers such that a #0 and consider the quadratic equation 
ax’ + bx+c=0, xeER. 


(i) Suppose b* — 4ac > 0. Use the Field Axioms and the preceding problem to complete the 
square and thereby show that this equation has exactly two solutions given by 


—b+/b — dac —_b — J — dac 
t= and = 


(ii) Now suppose b? — 4ac < 0. Show that the quadratic equation fails to have any solution. 
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6. Use the Completeness Axiom to show that every nonempty set of real numbers that is 
bounded below has an infimum and that 


inf E = —sup {-x| x€E}. 


7. For real numbers a and J, verify the following: 
(i) Jab] = |allbl. 
(ii) ja +b] < ja] + |d]. 
(ui) For e > 0, 
|x —a| <eifandonlyifa—e<x<a+te. 


1.2 THE NATURAL AND RATIONAL NUMBERS 


It is tempting to define the natural numbers to be the numbers 1, 2, 3, ... and so on. However, 
it is necessary to be more precise. A convenient way to do this is to first introduce the concept 
of an inductive set. 


Definition A set E of real numbers is said to be inductive provided it contains 1 and if the 
number x belongs to E, the number x + 1 also belongs to E. 


The whole set of real numbers R is inductive. From the inequality 1 > 0 we infer that 
the sets {x € R | x > 0} and {x ER | x > 1} are inductive. The set of natural numbers, denoted 
by N, is defined to be the intersection of all inductive subsets of R. The set N is inductive. 
To see this, observe that the number 1 belongs to N since 1 belongs to every inductive set. 
Furthermore, if the number k belongs to N, then k belongs to every inductive set. Thus, k + 1 
belongs to every inductive set and therefore k + 1 belongs to N. 


Principle of Mathematical Induction For each natural number n, let S(n) be some mathe- 
matical assertion. Suppose S(1) is true. Also suppose that whenever k is a natural number for 
which S(k) is true, then S(k +1) is also true. Then S(n) is true for every natural number n. 


Proof Define A = {kEN | S(k) is true}. The assumptions mean precisely that A is an 
inductive set. Thus N C A. Therefore S(n) is true for every natural number n. U 


Theorem 1 Every nonempty set of natural numbers has a smallest member. 


Proof Let E be a nonempty set of natural numbers. Since the set {x € R | x > 1} is inductive, 
the natural numbers are bounded below by 1. Therefore E is bounded below by 1. As a 
consequence of the Completeness Axiom, E has an infimum; define c = inf E. Since c + 1is 
not a lower bound for E, there is an m € E for which m<c+1. We claim that m is the smallest 
member of E. Otherwise, there is an n € E for which n < m. Since n€ E, c <n. Thusc < 
n<m<c+1and therefore m—n <1. Therefore the natural number m belongs to the interval 
(n, n+1). An induction argument shows that for every natural number n, (n, n+1)NN=90 
(see Problem 8). This contradiction confirms that m is the smallest member of E. C 


Archimedean Property For each pair of positive real numbers a and b, there is a natural 
number n for which na > b. 
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Proof Define c = b/a > 0. We argue by contradiction. If the theorem is false, then c is an 
upper bound for the natural numbers. By the Completeness Axiom, the natural numbers 
have a supremum; define cp = supN. Then co — 1 is not an upper bound for the natural 
numbers. Choose a natural number n such that n > co — 1. Therefore n + 1 > co. But the 
natural numbers are inductive so that n + 1 is a natural number. Since n + 1 > co, co is not 
an upper bound for the natural numbers. This contradiction completes the proof. U 


We frequently use the Archimedean Property of R reformulated as follows; for each 
positive real number e¢, there is a natural number n for which 1 j/n< «2 

We define the set of integers, denoted by Z, to be the set of numbers consisting of the 
natural numbers, their negatives, and the number 0. The set of rational numbers, denoted by 
Q, is defined to be the set of quotients of integers, that is, numbers x of the form x = m/n, 
where m and n are integers and n#0. A real number is called irrational if it is not rational. As 
we argued in Problem 4, there is a unique positive number x for which x* = 2; it is denoted 
by /2. This number is not rational. Indeed, suppose p and q are natural numbers for which 
(p/q)* = 2. Then p* = 2q’. The prime factorization theorem? tells us that 2 divides p? 
just twice as often as it divides p. Hence 2 divides p* an even number of times. Similarly, 2 
divides 2g” an odd number of times. Thus p” # 2? and therefore J? is irrational. 


Definition A set E of real numbers is said to be dense in R provided between any two real 
numbers there lies a member of E. 


Theorem 2 The rational numbers are dense in R. 


Proof Let a and b be real numbers with a < b. First suppose that a > 0. By the Archimedean 
Property of R, there is a natural number g for which (1/q) < b— a. Again using the 
Archimedean Property of R, the set of natural numbers S = {n € N|n/gq > b} is nonempty. 
According to Theorem 1, S has a smallest member p. Observe that 1/g <b—a<b and hence 
p> 1. Therefore p — 1 is a natural number (see Problem 9) and so, by the minimality of the 
choice of p, (p —1)/q <b. We also have 


a=b—(b—a) <(p/q) —(1/q) =(p—1)/@. 


Therefore the rational number r = (p — 1)/q lies between a and Db. If a <0, by the 
Archimedean property of R, there is a natural number n for which n > —a. We infer from 
the first case considered that there is a rational number r that lies between n + a andn + b. 
Therefore the natural number r — n lies between a and b. L 


PROBLEMS 


8. Use an induction argument to show that for each natural number n, the interval (n, n + 1) 
fails to contain any natural number. 


2 Archimedeas exphcitly asserted that it was his fellow Greek, Eurathostenes, who identified the property that 
we have here attributed to Archimedeas. 

3This theorem asserts that each natural number may be uniquely expressed as the product of prime natural 
numbers; see [BM97]. 
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9. Use an induction argument to show that if n > 1 is a natural number, then n — 1 also is a 
natural number. Then use another induction argument to show that if m and n are natural 
numbers with n > m, then n — m is a natural number. 


10. Show that for any integer n, there is exactly one integer in the interval [n, n + 1). 
11. Show that any nonempty set of integers that is bounded above has a largest member. 
12. Show that the irrational numbers are dense in R. 


13. Show that each real number is the supremum of a set of rational numbers and also the 
supremum of a set of irrational numbers. 


14. Show that ifr > 0, then, for each natural number n, (1+ r)" >1+n-r. 
15. Use induction arguments to prove that for every natural number n, 


0 = (n+ 1)(2n +1) 
2 mn 
(ii) 
P42P4...4n3=(1424...4n). 
(iit) 


—-1 


1— 
l+rt+...¢r= ifr#1 


1.3  COUNTABLE AND UNCOUNTABLE SETS 


In the preliminaries we called two sets A and B equipotent provided there is a one-to-one 
mapping f of A onto B. We refer to such an f as a one-to-one correspondence between 
the sets A and B. Equipotence defines an equivalence relation among sets, that is, it is 
reflexive, symmetric, and transitive (see Problem 20). It is convenient to denote the initial 
segment of natural numbers {k € N|1<k <n} by {1,...,}. The first observation regarding 
equipotence is that for any natural numbers n and m, the set {1, ..., n +m} is not equipotent 
to the set {1,...,}. This observation is often called the pigeonhole principle and may be 
proved by an induction argument with respect to n (see Problem 21). 


Definition A set E is said to be finite provided either it is empty or there is a natural number 
n for which E is equipotent to {1,...,n}. We say that E is countably infinite provided E is 
equipotent to the set N of natural numbers. A Set that is either finite or countably infinite is said 
to be countable. A set that is not countable is called uncountable. 


Observe that if a set is equipotent to a countable set, then it is countable. In the proof 
of the following theorem we will use the pigeonhole principle and Theorem 1, which tells us 
that every nonempty set of natural numbers has a smallest, or first, member. 


Theorem 3 A subset of a countable set is countable. In particular, every set of natural numbers 
is countable. 


Proof Let B be a countable set and A a nonempty subset of B. First consider the case that Bis 
finite. Let f be a one-to-one correspondence between {1,...,n} and B. Define g(1) to be the 
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first natural number j,1 < j <n, for which f(j) belongs to A. If A = {f(g(1))} the proof is 
complete since f o g is a one-to-one correspondence between {1} and A. Otherwise, define 
g(2) to be the first natural number j, 1 < j <n, for which f(j) belongs to A~{f(g9(1))}. 
The pigeonhole principle tells us that this inductive selection process terminates after at 
most N selections, where N <n. Therefore f o g is a one-to-one correspondence between 
{1,..., N} and A. Thus A is finite. 


Now consider the case that Bis countably infinite. Let f be a one-to-one correspondence 
between N and B. Define g(1) to be the first natural number j for which f(j) belongs to A. 
Arguing as in the first case, we see that if this selection process terminates, then A is finite. 
Otherwise, this selection process does not terminate and g is properly defined on all of N. 
It is clear that f o g is a one-to-one mapping with domain N and image contained in A. An 
induction argument shows that g(j) > j for all j. For each x € A, there is some k for which 
x = f(k). Hence x belongs to the set {f(g(1)),..., f(g(k))}. Thus the image of f 0 gis A. 
Therefore A is countably infinite. U 


Corollary 4 The following sets are countably infinite: 


n times 
-____ 
(i) For each natural numbers n, the Cartesian product NX---XN. 


(ii) The set of rational numbers Q. 


Proof We prove (i) for n = 2 and leave the general case as an exercise in induction. Define 
the mapping g from N XN to N by g(m, n) = (m+n)? +n. The mapping g is one-to-one. 
Indeed, if g(m, n) = g(m’, n’), then (m+n)* —(m' +n’)? =n’ —n and hence 


Imnt+n+m' +n'|-|\m+n—m' —n'|=|n' —nl. 


Ifn #n’, then the natural number m +n + m’ +n’ both divides and is greater than the natural 
number |n’ —n|, which is impossible. Thus n = n’, and hence m = m’. Therefore N XN 
is equipotent to g(N XN), a subset of the countable set N. We infer from the preceding 
theorem that N XN is countable. To verify the countability of Q we first infer from the 
prime factorization theorem that each positive rational number x may be written uniquely 
as x = p/q where p and q are relatively prime natural numbers. Define the mapping g from 
Q to N by 9(0) = 0, e(p/q) = (p+ q)* + qif x = p/q > 0 and p and q are relatively prime 
natural numbers and g(x) = —g(— x) if x <0. We leave it as an exercise to show that g is 
one-to-one. Thus Q is equipotent to a subset of N and hence, by the preceding theorem, is 
countable. We leave it as an exercise to use the pigeonhole principle to show that neither 
N XN nor Q is finite. a 


For a countably infinite set X, we say that {x, |n € N} is an enumeration of X provided 
X= {xn | neN} and x, #Xm wn#m. 


Theorem 5 A nonempty set is countable if and only if it is the image of a function whose 
domain is a nonempty countable set. 
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Proof Let A be a nonempty countable set and f be mapping of A onto B. We suppose 
that A is countably infinite and leave the finite case as an exercise. By composing with a 
one-to-one correspondence between A and N, we may suppose that A = N. Define two 
points x, x’ in A to be equivalent provided f(x) = f(x’). This is an equivalence relation, that 
is, it is reflexive, symmetric, and transitive. Let E be a subset of A consisting of one member 
of each equivalence class. Then the restriction of f to E is a one-to-one correspondence 
between E and B. But EF is a subset of N and therefore, by Theorem 3, is countable. The set 
B is equipotent to E and therefore B is countable. The converse assertion is clear; if B is a 
nonempty countable set, then it is equipotent either to an initial segment of natural numbers 
or to the natural numbers. L 


Corollary 6 The union of a countable collection of countable sets is countable. 


Proof Let A be a countable set and for each A€ A, let E, be a countable set. We will show 
that the union EF = Uyc¢, E, is countable. If E is empty, then it is countable. So we assume 
E#@. We consider the case that A is countably infinite and leave the finite case as an exercise. 
Let {A, |n € N} be an enumeration of A. Fixn € N. If E), is finite and nonempty, choose 
a natural number N(n) and a one-to-one mapping f, of {1,..., N(n)} onto E),; if E), is 
countably infinite, choose a a one-to-one mapping f, of N onto E),. Define 


E' = {(n, k) €NXNIE), is nonempty, and 1 < k < N(n) if Ej, is also finite}. 


Define the mapping f of E’ to E by f(n, k) = f,(k). Then f is a mapping of E’ onto E. 
However, E’ is a subset of the countable set N X N and hence, by Theorem 3, is countable. 
Theorem 5 tells us that E also is countable. L] 


We call an interval of real numbers degenerate if it is empty or contains a single 
member. 


Theorem 7 A nondegenerate interval of real numbers is uncountable. 


Proof Let J be a nondegenerate interval of real numbers. Clearly J is not finite. We argue by 
contradiction to show that J is uncountable. Suppose / is countably infinite. Let {x, |n € N} 
be an enumeration of J. Let [a,, b,| be a nondegenerate closed, bounded subinterval of I 
which fails to contain x;. Then let [a2, by] be a nondegenerate closed, bounded subinterval of 
[a,, b;], which fails to contain x2. We inductively choose a countable collection {[a,, bn]}°°, 
of nondegenerate closed, bounded intervals, which is descending in the sense that, for 
each n, [ans1, Dp41] C [an, by] and such that for each n, x, E[an, b,]. The nonempty set 
E = {a,|n €N} 1s bounded above by b;. The Completeness Axiom tells us that E has a 
supremum. Define x* = sup E. Since x* is an upper bound for E, a, < x* for all n. On the 
other hand, since {[a,, b,]}°°, is descending, for each n, b, is an upper bound for E. Hence, 
for each n, x* < b,. Therefore x* belongs to [a,, b,] for each n. But x* belongs to [a;, by] C J 
and therefore there is a natural number no for which x* = x,,. We have a contradiction since 
x* = X,, does not belong to [a,, bn]. Therefore J is uncountable. OO 
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PROBLEMS 
16. Show that the set Z of integers is countable. 
17. Show that a set A is countable if and only if there is a one-to-one mapping of A to N. 
18. Use an induction argument to complete the proof of part (i) of Corollary 4. 
19. Prove Corollary 6 in the case of a finite family of countable sets. 


20. Let both f: A B and g: BC be one-to-one and onto. Show that the composition 
go f: A Band the inverse f~!: B— A are also one-to-one and onto. 


21. Use an induction argument to establish the pigeonhole principle. 
22. Show that 2) the collection of all sets of natural numbers, is uncountable. 


23. Show that the Cartesian product of a finite collection of countable sets is countable. Use 
the preceding problem to show that NN, the collection of all mappings of N into N, is not 
countable. 

24. Show that a nondegenerate interval of real numbers fails to be finite. 

25. Show that any two nondegenerate intervals of real numbers are equipotent. 


26. Is the set R X R equipotent to R? 


1.4 OPEN SETS, CLOSED SETS, AND BOREL SETS OF REAL NUMBERS 


Definition A set O of real numbers is called open provided for each x €O, there is ar >0 for 
which the interval (x —r, x +r) is contained in O. 


For a <b, the interval (a, b) is an open set. Indeed, let x belong to (a, b). Define 
r = min{b — x, x — a}. Observe that (x —r, x +r) is contained in (a, b). Thus (a, b) is an 
open bounded interval and each bounded open interval is of this form. For a,b € R, we 
defined 


(a,00) = {xER| a<x},(-00,b)= {xe R| x <b} and (—co,00) =R. 


Observe that each of these sets is an open interval. Moreover, it is not difficult to see that 
since each set of real numbers has an infimum and supremum in the set of extended real 
numbers, each unbounded open interval is of the above form. 


Proposition 8 The set of real numbers R and the empty-set ® are open; the intersection of any 
finite collection of open sets is open; and the union of any collection of open sets is open. 


Proof It is clear that R and % are open and the union of any collection of open sets is 
open. Let {Ox};_, be a finite collection of open subsets of R. If the intersection of this 
collection is empty, then the intersection is the empty-set and therefore is open. Otherwise, 
let x belong to M_, Ox. For 1 < k <n, choose rz > 0 for which (x — ry, x +r~) C Ox. Define 
r=min{r;,...,%m}.Thenr >0 and (x-—r, xt+r)C M1 Ox. Therefore M_,Ox. is open. U1 


It is not true, however, that the intersection of any collection of open sets is open. For 
example, for each natural number n, let O, be the open interval (—1/n, 1/n). Then, by the 
Archimedean Property of R, °°, O, = {0}, and {0} is not an open set. 
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Proposition 9 Every nonempty open set is the disjoint union of a countable collection of open 
intervals. 


Proof Let O be a nonempty open subset of R. Let x belong to O. There is a y > x for which 
(x, y) CO and az < x for which (z, x) C O. Define the extended real numbrs a, and b, by 


a, = inf {z| (z, x) CO} and b, = sup {y| (x, y) CO}. 
Then I, = (a,, b,) is an open interval that contains x. We claim that 
I, CO but a, ¢O, b, 0. (2) 


Indeed, let w belong to I,, say x < w < b,. By the definition of b,, there is a number 
y > w such that (x, y) C O, and so weO. Moreover, b, €O, for if b; €O, then for some 
r >0 we have (b, —r, b, +r) C O. Thus (x, b, +r) C O, contradicting the definition 
of b,. Similarly, a, ¢ O. Consider the collection of open intervals {7,},<-o. Since each x in 
O is a member of /,, and each [, is contained in O, we have O = U,<o1,. We infer 
from (2) that {1,},-o 1s disjomt. Thus © is the union of a disjoint collection of open 
intervals. It remains to show that this collection is countable. By the density of the rationals, 
Theorem 2, each of these open intervals contains a rational number. This establishes a 
one-to-one correspondence between the collection of open intervals and a subset of the 
rational numbers. We infer from Theorem 3 and Corollary 4 that any set of rational 
numbers is countable. Therefore O is the union of a countable disjoint collection of open 
intervals. C] 


Definition For a set E of real numbers, a real number x is called a point of closure of E 
provided every open interval that contains x also contains a point in E. The collection of points 
of closure of E is called the closure of E and denoted by E. 


It is clear that we always have E C E. If E contains all of its points of closure, that is, 
E = E, then the set E is said to be closed. 


Proposition 10 For a set of real numbers E, its closure E is closed. Moreover, E is the smallest 
closed set that contains E in the sense that if F is closed and E C F, then EC F. 


Proof The set E is closed provided it contains all its points of closure. Let x be a point of 
closure of E. Consider an open interval I, which contains x. There is a point x’ € EN 1,. Since 
x’ is a point of closure of E and the open interval [,, contains x’, there is a point x” € EN I,. 
Therefore every open interval that x also contains a point of E and hence x € E. So the set E 
is closed. It is clear that if A C B, then A C B, and hence if F is closed and contains E, then 
ECF=F. O 


Proposition 11 A set of real numbers is open if and only if its complement in R is closed. 


Proof First suppose E is an open subset of R. Let x be a point of closure of R~ E. Then x 
cannot belong to E because otherwise there would be an open interval that contains x and 
is contained in E and thus is disjoint from R ~ E. Therefore x belongs to R~ E and hence 
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R~ E is closed. Now suppose R~ E is closed. Let x belong to E. Then there must be an 
open interval that contains x that is contained in E, for otherwise every open interval that 
contains x contains points in X ~ E and therefore x is a point of closure of R~ E. Since 
R ~ E is closed, x also belongs to R ~ E. This is a contradiction. L 


Since R ~~ [R~ E] = E, it follows from the preceding proposition that a set is closed if 
and only if its complement is open. Therefore, by De Morgan’s Identities, Proposition 8 may 
be reformulated in terms of closed sets as follows. 


Proposition 12 The empty-set 0 and R are closed; the union of any finite collection of closed 
sets is closed; and the intersection of any collection of closed sets is closed. 


A collection of sets {E,},cq is said to be a cover of a set E provided E C Ujeq Ey. 
By a subcover of a cover of E we mean a subcollection of the cover that itself also is a cover 
of E. If each set E, in a cover is open, we call {E,},<, an open cover of F. If the cover 
{E)},eq4 contains only a finite number of sets, we call it a finite cover. This terminology is 
inconsistent: In “open cover” the adjective “open”’ refers to the sets in the cover; in “finite 
cover’’ the adjective “finite” refers to the collection and does not imply that the sets in the 
collection are finite sets. Thus the term “open cover” is an abuse of language and should prop- 
erly be “cover by open sets.”” Unfortunately, the former terminology is well established in 
mathematics. 


The Heine-Borel Theorem Let F be a closed and bounded set of real numbers. Then every 
open cover of F has a finite subcover. 


Proof Let us first consider the case that F is the closed, bounded interval [a, b]. Let F 
be an open cover of [a, b]. Define E to be the set of numbers x €[a, b] with the property 
that the interval [a, x] can be covered by a finite number of the sets of F. Since ae E, E is 
nonempty. Since E is bounded above by b, by the completeness of R, E has a supremum; 
define c = sup E. Since c belongs to [a, b], there is an O € F that contains c. Since is open 
there is an € > 0, such that the interval (c — €, c + €) is contained in O. Now c — ¢€ is not an 
upper bound for E, and so there must be an x € E with x > c — €. Since x € E, there is a finite 
collection {O;,...,O,} of sets in F that covers [a, x]. Consequently, the finite collection 
{O1,..., Ox, O} covers the interval [a, c+ €). Thus c = b, for otherwise c <b and c is not an 
upper bound for E. Thus [a, b] can be covered by a finite number of sets from F, proving 
our special case. 


Now let F be any closed and bounded set and F an open cover of F. Since F is 
bounded, it is contained in some closed, bounded interval [a, b]. The preceding proposition 
tells us that the set O = R~ F is open since F is closed. Let F* be the collection of open 
sets obtained by adding O to F, that is, ¥* = F UO. Since F covers F, F* covers [a, b]. By 
the case just considered, there is a finite subcollection of F* that covers [a, b] and hence F. 
By removing © from this finite subcover of F, if O belongs to the finite subcover, we have a 
finite collection of sets in F that covers F. L 


We say that a countable collection of sets {E,}°°, is descending or nested provided 
En+41 © E, for every natural number n. It is said to be ascending provided E,, C E,,4; for 
every natural number n. 
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The Nested Set Theorem Let (F,}°°, be a descending countable collection of nonempty 
closed sets of real numbers for which F, bounded. Then 


OO 
() Fn #9. 
n=1 


Proof We argue by contradiction. Suppose the intersection is empty. Then for each real 
number x, there is a natural number a for which x ¢ F,, that is, x €O, = R~ F,. Therefore 

°°, On = R. According to Proposition 4, since each F,, is closed, each O, is open. Therefore 
{O,,}°C, is an open cover of R and hence also of F. The Heine-Borel Theorem tells us 
that there a natural number N for which F C Un, O,. Since {F,}°C, is descending, the 
collection of complements {O,,}°° , is ascending. Therefore Un O, = On = R~ Fy. Hence 
F, CR~ Fy, which contradicts the assumption that Fy is a nonempty subset of F;. C 


Definition Given a set X, acollection A of subsets of X is called a o-algebra (of subsets of X) 
provided (i) the empty-set, 0, belongs to A; (ii) the complement in X of a set in A also belongs 
to A; (iii) the union of a countable collection of sets in A also belongs to A. 


Given a set X, the collection {@, X} is a o-algebra which has two members and is 
contained in every o-algebra of subsets of X. At the other extreme is the collection of sets 
2* which consists of all subsets of X and contains every o-algebra of subsets of X. For 
any o-algebra A, we infer from De Morgan’s Identities that A is closed with respect to 
the formation of intersections of countable collections of sets that belong to A; moreover, 
since the empty-set belongs to A, A is closed with respect to the formation of finite 
unions and finite intersections of sets that belong to A. We also observe that a o-algebra 
is closed with respect to relative complements since if A; and Az belong to A, so does 
Ay ~ Az = Ay N[X~ Ap]. The proof of the following proposition follows directly from the 
definition of o-algebra. 


Proposition 13. Let F be a collection of subsets of a set X. Then the intersection A of all 
o-algebras of subsets of X that contain F is a o-algebra that contains F. Moreover, it is the 
smallest o-algebra of subsets of X that contains F in the sense that any o-algebra that contains 
F also contains A. 


Let {A,}°., be a countable collection of sets that belong to a o-algebra A. Since A 
is closed with respect to the formation of countable intersections and unions, the following 
two sets belong to A: 


o.@) o.@) o.@) o.@) 
lim sup(A,}°, =() A, | and liminf{A,}°2, =U A, |. 
k=1 |n=k k=1 | n=k 


The set lim sup{A,,}°, is the set of points that belong to A, for countably infinitely many 
indices n while the set lim inf{A,,}°C, is the set of points that belong to A, except for at most 
finitely many indices n. 

Although the union of any collection of open sets is open and the intersection of 
any finite collection of open sets is open, as we have seen, the intersection of a countable 
collection of open sets need not be open. In our development of Lebesgue measure and 
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integration on the real line, we will see that the smallest o-algebra of sets of real numbers 
that contains the open sets is a natural object of study. 


Definition The collection B of Borel sets of real numbers is the smallest o-algebra of sets of 
real numbers that contains all of the open sets of real numbers. 


Every open set is a Borel set and since a g-algebra is closed with respect to the formation 
of complements, we infer from Proposition 4 that every closed set is a Borel set. Therefore, 
since each singleton set is closed, every countable set is a Borel set. A countable intersection 
of open sets is called a Gs set. A countable union of closed sets is called an F, set. Since 
a o-algebra is closed with respect to the formation of countable unions and countable 
intersections, each Gs set and each F, set is a Borel set. Moreover, both the liminf and 
lim sup of a countable collection of sets of real numbers, each of which is either open or 
closed, is a Borel set. 


PROBLEMS 
27. Is the set of rational numbers open or closed? 
28. What are the sets of real numbers that are both open and closed? 
29. Find two sets A and B such that AN B = Gand AN B#Q. 


30. A point x is called an accumulation point of a set E provided it is a point of closure of E ~ {x}. 
(i) Show that the set E’ of accumulation points of E is a closed set. 


(ii) Show that E = EU E’. 
31. A point x is called an isolated point of a set E provided there is an r > 0 for which 
(x—r, x+r) NE = {x}. Show that if a set E consists of isolated points, then it is countable. 


32. A point x is called an interior point of a set E if there is an r > 0 such that the open interval 
.(x—r, x +r) is contained in E. The set of interior points of E is called the interior of E 
denoted by int E. Show that 


(i) Eis open if and only if E = int E. 
(ii) E is dense if and only if int(R ~ E) = 9. 
33. Show that the Nested Set Theorem is false if F, is unbounded. 


34. Show that the assertion of the Heine-Borel Theorem is equivalent to the Completeness 
Axiom for the real numbers. Show that the assertion of the Nested Set Theorem is equivalent 
to the Completeness Axiom for the real numbers. 


35. Show that the collection of Borel sets is the smallest c-algebra that contains the closed sets. 


36. Show that the collection of Borel sets is the smallest c-algebra that contains intervals of the 
form [a, b), where a < b. 


37. Show that each open set is an F,, set. 


1.5 SEQUENCES OF REAL NUMBERS 


A sequence of real numbers is a real-valued function whose domain is the set of natu- 
ral numbers. Rather than denoting a sequence with standard functional notation such as 
f : NR, it is customary to use subscripts, replace f(n) with a,, and denote a sequence 
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by {a,}. A natural number n is called an index for the sequence, and the number a, cor- 
responding to the index n is called the nth term of the sequence. Just as we say that a 
real-valued function is bounded provided its image is a bounded set of real numbers, we 
Say a sequence {a,} is bounded provided there is some c > 0 such that |a,| < c for all n. 
A sequence is said to be increasing provided a, < a,,; for all n, is said to be decreasing 
provided {—a,} is inceasing, and said to be monotone provided it is either increasing or 
decreasing. 


Definition A sequence {a,} is said to converge to the number a provided for every € > 0, 
there is an index N for which 


ifn > N, then |a — a,| <€. (3) 
We call a the limit of the sequence and denote the convergence of {a,} by writing 


{an} aor lim a, =a. 
n> wo 


We leave the proof of the following proposition as an exercise. 


Proposition 14 Let the sequence of real numbers {a,} converge to the real number a. Then 
the limit is unique, the sequence is bounded, and, for a real number c, 


if an <c foralln, thena <c. 


Theorem 15 (the Monotone Convergence Criterion for Real Sequences) A monotone 
sequence of real numbers converges if and only if it is bounded. 


Proof Let {a,} be an increasing sequence. If this sequence converges, then, by the preceding 
proposition, it is bounded. Now assume that {a,} is bounded. By the Completeness Axiom, 
the set S = {a, |n € N} has a supremum: define a = sup S. We claim that {a,} — a. Indeed, 
let € > 0. Since s is an upper bound for S, a, < a for all n. Since a — € is not an upper bound 
for S, there is an index N for which ay > a — «. Since the sequence is increasing, a, >a — € 
for alln > N. Thus ifn > N, then |a — a,| < ¢. Therefore {a,,} + a. The proof for the case 
when the sequence is decreasing is the same. CI 


For a sequence {a,} and a strictly increasing sequence of natural numbers {n;}, we call 
the sequence {a,,} whose kth term is a,, a subsequence of {a;,}. 


Theorem 16 (the Bolzano-Weierstrass Theorem) Every bounded sequence of real numbers 
has a convergent subsequence. 


Proof Let {a,} be a bounded sequence of real numbers. Choose M > 0 such that |a,| < M 
for all n. Let n be a natural number. Define E,, = {a;| j > n}. Then E, C[—M, M] and E, is 
closed since it is the closure of a set. Therefore {E,,} is a descending sequence of nonempty 
closed bounded subsets of R. The Nested Set Theorem tells us that °°, E, #@; choose 
aé ()\-, En. For each natural number k, a is a point of closure of {a;| j = k}. Hence, for 
infinitely many indices j > n, a; belongs to (a—1/k, a+1/k). We may therefore inductively 
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choose a strictly increasing sequence of natural numbers {n;} such that Ja—a, .|<1/k for all k. 
By the Archimedean Property of R, the subsequence {a, ,} converges to a. L 


Definition A sequence of real numbers {an} is said to be Cauchy provided for each € > 0, 
there is an index N for which 


ifn,m > N, then |am — an| <e. (4) 


Theorem 17 (the Cauchy Convergence Criterion for Real Sequences) A sequence of real 
numbers converges if and only if it is Cauchy. 


Proof First suppose that {a,}—> a. Observe that for all natural numbers n and m, 
lan — Am| = |(@n — a) + (@—am)| < |an — al + lam — al. (5) 


Let € > 0. Since {a,}—> a, we may choose a natural number N such that if n > N, then 
|a, —a| <€/2. We infer from (5) that ifn, m > N, then |a, — a,,| <<. Therefore the sequence 
{a,} is Cauchy. To prove the converse, let {a,} be a Cauchy sequence. We claim that it is 
bounded. Indeed, for « = 1, choose N such that ifn, m > N, then lan — Am| < 1. Thus 


lan| = |(@n — an) + an| < lan — ay| + lay| < 1+ lay| for alln > N. 


Define M = 1 + max{|aj|,..., |an|}. Then |a,| < M for all n. Thus {a,} is bounded. The 
Bolzano- Weierstrass Theorem tells us there is a subsequence {a,, } which converges to a. We 
claim that the whole sequence converges to a. Indeed, let € > 0. Since {a,} is Cauchy we may 
choose a natural number N such that 


ifn,m > N, then ja, —a,| << €/2. 


On the other hand, since {a,,}-—» a we may choose a natural number n; such that la — an, | 
<¢/2 and n; > N. Therefore 


lan — al = |(Gn — Qn, ) + (Gn, —@)| < lap ~An,| + |a—ay,|<e¢ for alln>N. a 


Theorem 18 (Linearity and Montonicity of Convergence of Real Sequences) Let {a,} and 
{b,} be convergent sequences of real numbers. Then for each pair of real numbers a and B, 
the sequence {a - a, + B - b,} is convergent and 


lim [a-a, +B- bn] =a- lim an+B- lim by. (6) 
nao now n> co 
Moreover, 
if an <b, foralln, then lim a, < lim bp. (7) 
nao n—-> oo 


Proof Define 
lim a, =aand lim b, =b. 
n—- coo n— &} 


Observe that 


Ia - a, + B- bn] -[a-a+B-b)| < |a| - la, —a| + |B] - |b, — | for all n. (8) 
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Let e« > 0. Choose a natural number N such that 
lan — al < €/[2 + 2|a|] and |b, — b| < «/[2 + 2|B|] for alln > N. 
We infer from (8) that 
|[a- a, +B-b,]-—[a-a+B-b]| <¢foralln > N. 


Therefore (6) holds. To verify (7), set c, = b, — a, for all n and c = b—a, Thenc, > 0 for 
all n and, by linearity of convergence, {c,}—> c. We must show c > 0. Let « > 0. There is an 
N such that 

—€<c—c, <eforalln >N. 


In particular, 0 < cy <c +. Since c > —e for every positive number ¢, c > 0. L 


If a sequence {a,} has the property that for each real number c, there is an index N 
such that ifn > N, then a, > c we say that {a,,} converges to infinity, call 00 the limit of {a,} 
and write lim, _, 90 d, = oo. Similar definitions are made at —oo. With this extended concept 
of convergence we may assert that any monotone sequence {a,,} of real numbers, bounded 
or unbounded, converges to an extended real number and therefore lim, -, .9 dn is properly 
defined. 

The extended concept of supremum and infimum of a set and of convergence for any 
monotone sequence of real numbers allows us to make the following definition. 


Definition Let {a,} be a sequence of real numbers. The limit superior of {a,}, denoted by 
lim sup{a,}, is defined by 


lim sup{an} = lim [sup {a, | k >n}). 
The limit inferior of {a,}, denoted by liminf{a,}, is defined by 
liminf{a,} = lim [inf {a, | k >n}). 
n> oo 


We leave the proof of the following proposition as an exercise. 


Proposition 19 Let {a,} and {b,} be sequences of real numbers. 
(i) limsup{a,} = ¢ € R if and only if for each € > 0, there are infinitely many indices n 
for which a, > £ — € and only finitely many indices n for which a, > £ +€. 
(ti) limsup{a,} = 00 if and only if {a,} is not bounded above. 
(iii) 
lim sup{a,} = — lim inf{—a,}. 
(iv) A sequence of real numbers {a,} converges to an extended real number a if and only if 


lim inf{a,} = lim sup{a,} = a. 


(v) If an <b, forall n, then 
lim sup{a,} < lim inf{b,}. 
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For each sequence {a;} of real numbers, there corresponds a sequence of partial sums 
{sn} defined by s, = >7;_, a for each index n. We say that the series Dp 4k 1s summable to 
the real number s provided {s,}—> s and write s = S/7° , ag. 

We leave the proof of the following proposition as an exercise. 


Proposition 20 Let {a,,} be a sequence of real numbers. 


(it) The series 3°, ax is summable if and only if for each € > 0, there is an index N for 
which 


n+m 


> 4 


k=n 


<€forn > N and any natural number m. 


(ii) If the series 37°, \ax| is summable, then 7° , a; also is summable. 
(iti) If each term a; is nonnegative, then the series 57°, ax is summable if and only if the 
sequence of partial sums is bounded. 


PROBLEMS 


38. We call an extended real number a cluster point of a sequence {a,,} if a subsequence converges 
to this extended real number. Show that lim inf{a,} is the smallest cluster point of {a,,} and 
lim sup{a,} is the largest cluster point of {a,}. 


39. Prove Proposition 19. 


40. Show that a sequence {a,} is convergent to an extended real number if and only if there is 
exactly one extended real number that is a cluster point of the sequence. 


41. Show that lim infa, < limsup ay. 
42. Prove that if, for alln, a, >0andb, > 0, then 


lim sup [a, - b,| < (limsup a,,) - (limsup },), 


provided the product on the right is not of the form 0 - oo. 


43. Show that every real sequence has a monotone subsequence. Use this to provide another 
proof of the Bolzano-Weierstrass Theorem. 


44. Let p be a natural number greater than 1, and x a real number, 0 < x < 1. Show that there is 
a sequence {a,,} of integers with 0 < a, < p for each n such that 


.-$4 
na P™ 


and that this sequence is unique except when x is of the form q/p", in which case there are 
exactly two such sequences. Show that, conversely, if {a,} is any sequence of integers with 
0 < a, < p, the series 

ie, ¢) An 


n=l p” 


converges to a real number x with 0 < x < 1. If p = 10, this sequence is called the decimal 
expansion of x. For p = 2 it is called the binary expansion; and for p = 3, the ternary 
expansion. 


45. Prove Proposition 20. 
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46. Show that the assertion of the Bolzano-Weierstrass Theorem is equivalent to the Complete- 
ness Axiom for the real numbers. Show that the assertion of the Monotone Convergence 
Theorem is equivalent to the Completeness Axiom for the real numbers. 


1.6 CONTINUOUS REAL-VALUED FUNCTIONS OF A REAL VARIABLE 


Let f be a real-valued function defined on a set E of real numbers. We say that f is 
continuous at the point x in E provided that for each € > 0, there is a 6 > 0 for which 


if x’ € E and |x’ — x| <6, then | f(x’) — f(x)| <e. 


The function f is said to be continuous (on £) provided it is continuous at each point in its 
domain E. The function f is said to be Lipschitz provided there is a c > 0 for which 


f(x’) — f(x)| < c+ |x’ — x| for all x’, x € E. 


It is clear that a Lipschitz functon is continuous. Indeed, for a number x € E and any € > 0, 
6 = e/c responds to the ¢ challenge regarding the criterion for the continuity of f at x. Not 
all continuous functions are Lipschitz. For example, if f(x) = ,/x for 0 < x < 1, then f is 
continuous on [0, 1] but is not Lipschitz. 

We leave as an exercise the proof of the following characterization of continuity at a 
point in terms of sequential convergence. 


Proposition 21 A real-valued function f defined on a set E of real numbers is continuous 
at the point x, € E if and only if whenever a sequence {x,} in E converges to x,, its image 
sequence { f(x, )} converges to f (xx). 


We have the following characterization of continuity of a function on all of its domain. 


Proposition 22 Let f be a real-valued function defined on a set E of real numbers. Then f is 
continuous on E if and only if for each open set O, 


f-}(O) = ENU where U is an open set. (9) 


Proof First assume the inverse image under f of any open set is the intersection of the 
domain with an open set. Let x belong to E. To show that f is continuous at x, let « > 0. The 
interval J = (f(x) —e, f(x) + €) is an open set. Therefore there is an open set U/ such that 


f(D) = (vl ek| f(x) -e< f(x) <f(x) +e} = ENU. 
In particular, f(E MU) CJ and x belongs to EN U. Since U is open there is a 6 > 0 such 
that (x — 6, x + 6) CU. Thus if x’ € E and |x’ — x| <6, then | f(x’) — f(x)| <e. Hence f is 
continuous at x. 
Suppose now that f is continuous. Let © be an open set and x belong to f~!(O). Then 
f(x) belongs to the open set O so that there is an € > 0, such that ( f(x) —e, f(x) +6) CO. 


Since f is continuous at x, there is a 6 > 0 such that if x’ belongs to E and |x’ — x| <6, then 
f(x’) — f(x)| <. Define 1, = (x — 6,x +6). Then f(ENI,) CO. Define 


u= |) &. 


xe f-!(O) 
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Since U is the union of open sets it is open. It has been constructed so that (9) holds. L 


The Extreme Value Theorem A continuous real-valued function on a nonempty closed, 
bounded set of real numbers takes a minimum and maximum value. 


Proof Let f be a continuous real-valued function on the nonempty closed bounded set E of 
real numbers. We first show that f is bounded on E, that is, there is a real number M such that 


[f(x)| < M for all xe E. (10) 


Let x belong to E. Let 6 > 0 respond to the « = 1 challenge regarding the criterion for 
continuity of f at x. Define I, = (x — 6, x + 6). Therefore if x’ belongs to EN J,, then 
| f(x’) — f(x)| <1 and so | f(x’)| < |f(x)| +1. The collection {1,},<¢¢ is an open cover of 
E. The Heine-Borel Theorem tells us that there are a finite number of points {x),..., xn} 
in E such that {J,,}7_, also covers E. Define M = 1 + max{|f(x1)|,..., |f(%n)|}. We claim 
that (10) holds for this choice of E. Indeed, let x belong to E. There is an index k such that 
x belongs to J,, and therefore | f(x)| < 1+ |f(x)| < M. To see that f takes a maximum 
value on E, define m = sup f(E). If f failed to take the value m on E, then the function 
x+>1/( f(x) —m), x € Eis a continuous function on E which is unbounded. This contradicts 
what we have just proved. Therefore f takes a maximum value of E. Since — f is continuous, 
— f takes a maximum value, that is, f takes a minimum value on E. L 


The Intermediate Value Theorem Let f be a continuous real-valued function on the closed, 
bounded interval [a, b] for which f(a) <c < f(b). Then there is a point xq in (a, b) at which 
f (xo) =c. 


Proof We will define by induction a descending countable collection {[a,, b,]}P°., of closed 
intervals whose intersection consists of a single point x9 € (a, b) at which f(x) =c. Define 
a, = a and b; = b. Consider the midpoint m, of [a;, b)]. If c < f(m,), define az = a, and 
bo = my. If f(m,) = c, define ay = m, and bp = by. Therefore f(a2) < c < f(b2) and 
by — a2 = [by — a; |/2. We inductively continue this bisection process to obtain a descending 
collection {[a,, by ]}P°., of closed intervals such that 


f (an) <c < f (by) and by, — a, = [b —a]/2"~! for all n. (11) 


According to the Nested Set Theorem, (\°°,[an, bn] is nonempty. Let xo belong to 
N22, [an, bn]. Observe that 


lan — X0| < Dn — Gn = [b — a}/2"~' for all n. 


Therefore {a,}—> xo. By the continuity of f at xo, {f(an)}—> f(x). Since f(a,) < c for 
all n, and the set (—oo, c] is closed, f(xo) < c. By a similar argument, f(x) > c. Hence 
f(xo) =c. O 


Definition A real-valued function f defined ona set E of real numbers is said to be uniformly 
continuous provided for each € > 0, there is a 5 > 0 such that for all x, x' in E, 


if |x — x'| <6, then | f(x) — f(x’)| <e. 
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Theorem 23 A continuous real-valued function on a closed, bounded set of real numbers is 
uniformly continuous. 


Proof Let f be a continuous real-valued function on a closed bounded set E of real 
numbers. Let « > 0. For each x € E, there is a 6, > 0 such that if x’ € E and |x’ — x| < 5,, then 
|f(x’) — f(x)| </2. Define I, to be the open interval (x — 6,/2, x + 6,/2). Then {I,},e 5 
is an open cover of E. According to the Heine-Borel Theorem, there is a finite subcollection 
{I,,,--+, 1, } which covers E. Define 


6= ) min{6,,, wang 6,,}. 
We claim that this 6 > 0 responds to the € > 0 challenge regarding the criterion for f to 
be uniformly continuous on E. Indeed, let x and x’ belong to E with |x — x’| < 6. Since 
{I,,,..-, [,,} covers E, there is an index k for which |x — x,| <6,,/2. Since |x—x'| <6 < 6;,/2, 


|x! — xx] < |x’ — x] + [x — xg] < 6,,/2 + 8y,/2 = Sy. 


By the definition of 5,,, since |x — x,| <6, and |x’ — x;| <6,, we have | f(x) — f(xz)| <«/2 
and | f(x’) —. f(x, )| < ¢/2. Therefore 


f(x) — F(2) S F(x) — F(x) 4 IFC’) — Fe) </2 +6/2 =e. O 


Definition A real-valued function f defined ona set E of real numbers is said to be increasing 
provided f(x) < f(x’) whenever x, x’ belong to E and x < x’, and decreasing provided — f 
is increasing. It is called monotone if it is either increasing or decreasing. 


Let f be a monotone real-valued function defined on an open interval J that contains 
the point x9. We infer from the Monotone Convergence Theorem for Sequence for Real 
Sequences that if {x,} is a sequence in 1M (xg, 00) which converges to xo, then the sequence 
{ f (Xn )} converges to a real number and the limit is independent of the choice of sequence 
{xn}. We denote the limit by f(xj ). Similarly, we define f (xp ). Then clearly f is continuous 
at xo if and only if f(x9 ) = f(x0) = f(x@ ). If f fails to be continuous at xo, then the only 
point of the image of f that lies between f(x) and f(x ) is f(x) and f is said to have a 
jump discontinuity at xo. Thus, by the Intermediate Value Theorem, a monotone function 
on an open interval is continuous if and only if its image is an interval (see Problem 55). 


PROBLEMS 


47. Let E be a closed set of real numbers and f a real-valued function that is defined and 
continuous on E£. Show that there is a function g defined and continuous on all of R such that 
f(x) = g(x) for each x € E. (Hint: Take g to be linear on each of the intervals of which R ~ E 
is composed.) 


48. Define the real-valued function f on R by setting 
x if x irrational 
f(x) = 


psin , if x = £ in lowest terms. 


At what points is f continuous? 
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49. 


50. 


51. 


57. 


58. 


59. 


60. 
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Let f and g be continuous real-valued functions with a common domain E. 
(i) Show that the sum, f + g, and product, fg, are also continuous functions. 


(ii) If h is a continuous function with image contained in E, show that the composition f oh 
is continuous. 


(iii) Let max{f, g} be the function defined by max{f, g}(x) = max{f(x), g(x)}, for xe E. 
Show that max{ f, g} is continuous. 


(iv) Show that | f| is continuous. 


Show that a Lipschitz function is uniformly continuous but there are uniformly continuous 
functions that are not Lipschitz. 


A continuous function ¢ on [a, b] is called piecewise linear provided there is a partition 
a= xy <x <---<x, =boef [a, b] for which g is linear on each interval [x;, x;41]. Let f bea 
continuous function on [a, b] and € a positive number. Show that there is a piecewise linear 
function ¢ on [a, b] with | f(x) — ¢(x)| < for all x €[a, Db]. 


. Show that a nonempty set E of real numbers is closed and bounded if and only if every 


continuous real-valued function on E takes a maximum value. 


. Show that a set E of real numbers is closed and bounded if and only if every open cover of E 


has a finite subcover. . 


. Show that a nonempty set E of real numbers is an interval if and only if every continuous 


real-valued function on E has an interval as its image. 


. Show that a monotone function on an open interval is continuous if and only if its image is an 


interval. 


. Let f be a real-valued function defined on R. Show that the set of points at which f is 


continuous 1s a Gs set. 


Let {f,} be a sequence of continuous functions defined on R. Show that the set of points x 
at which the sequence {f,,(x)} converges to a real number is the intersection of a countable 
collection of F,, sets. 


Let f be a continuous real-valued function on R. Show that the inverse image with respect to 
f of an open set is open, of a closed set is closed, and of a Borel set is Borel. 


A sequence { f,,} of real-valued functions defined on a set E is said to converge uniformly on 
E to a function f if given € > 0, there is an N such that for all x € E and all n > N, we have 
lfn(x) — f(x)| <e. Let {f,} be a sequence of continuous functions defined on a set E. Prove 
that if { f,} converges uniformly to f on E, then f is continuous on E. 


Prove Proposition 21. Use this proposition and the Bolzano-Weierstrass Theorem to provide 
another proof of the Extreme Value Theorem. 
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2.1 INTRODUCTION 


The Riemann integral of a bounded function over a closed, bounded interval is defined 
using approximations of the function that are associated with partitions of its domain 
into finite collections of subintervals. The generalization of the Riemann integral to the 
Lebesgue integral will be achieved by using approximations of the function that are 
associated with decompositions of its domain into finite collections of sets which we call 
Lebesgue measurable. Each interval is Lebesgue measurable. The richness of the collection 
of Lebesgue measurable sets provides better upper and lower approximations of a function, 
and therefore of its integral, than are possible by just employing intervals. This leads to a 
larger class of functions that are Lebesgue integrable over very general domains and an 
integral that has better properties. For instance, under quite general circumstances we will 
prove that if a sequence of functions converges pointwise to a limiting function, then the 
integral of the limit function is the limit of the integrals of the approximating functions. 
In this chapter we establish the basis for the forthcoming study of Lebesgue measurable 
functions and the Lebesgue integral: the basis is the concept of measurable set and the 
Lebesgue measure of such a set. 7 

The length 2(7) of an interval J is defined to be the difference of the endpoints of I 
if J is bounded, and ov if J is unbounded. Length is an example of a set function, that is, a 
function that associates an extended real number to each set in a collection of sets. In the 
case of length, the domain is the collection of all intervals. In this chapter we extend the set 
function length to a large collection of sets of real numbers. For instance, the “length” of an 
open set will be the sum of the lengths of the countable number of open intervals of which 
it is composed. However, the collection of sets consisting of intervals and open sets is still 
too limited for our purposes. We construct a collection of sets called Lebesgue measurable 
sets, and a set function of this collection called Lebesgue measure which is denoted by m. 
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The collection of Lebesgue measurable sets is a c-algebra! which contains all open sets and 
all closed sets. The set function m possesses the following three properties. 


The measure of an interval is its length Each nonempty interval I is Lebesgue mea- 
surable and 


m(I) = (1). 


Measure is translation invariant If E is Lebesgue measurable and y is any number, then 
the translate of E by y, E+ y = {x + y|x€ E}, also is Lebesgue measurable and 


m(E+y)=m(E). 


Measure is countably additivity over countable disjoint unions of sets” If {Ex}po, is a 
countable disjoint collection of Lebesgue measurable sets, then 


n(C e = S m(E,). 
k=1 


k=1 


It is not possible to construct a set function that possesses the above three properties 
and is defined for all sets of real numbers (see page 48). In fact, there is not even a set function 
defined for all sets of real numbers that possesses the first two properties and is finitely 
additive (see Theorem 18). We respond to this limitation by constructing a set function on a 
very rich class of sets that does possess the above three properties. The construction has two 
stages. 

We first construct a set function called outer-measure, which we denote by m*. It 
is defined for any set, and thus, in particular, for any interval. The outer measure of an 
interval is its length. Outer measure is translation invariant. However, outer measure is not 
finitely additive. But it is countably subadditive in the sense that if {Ex}7°, is any countable 
collection of sets, disjoint or not, then 


m(O ® | < S m*( Ex). 
k=1 k=1 


The second stage in the construction is to determine what it means for a set to be Lebesgue 
measurable and show that the collection of Lebesgue measurable sets is a o-algebra 
containing the open and closed sets. We then restrict the set function m* to the collection 
of Lebesgue measurable sets, denote it by m, and prove m is countably additive. We call m 
Lebesgue measure. 


‘A collection of subsets of R is called a o-algebra provided it contains R and is closed with respect to the 
formation of complements and countable unions; by De Morgan’s Identities, such a collection is also closed with 
respect to the formation of countable intersections. 

2For a collection of sets to be disjoint we mean what is sometimes called pairwise disjoint, that is, that each pair 
of sets in the collection has empty intersection. 
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PROBLEMS 


In the first three problems, let m be a set function defined for all sets in a o-algebra A with values 
in [0, co]. Assume m is countably additive over countable disjoint collections of sets in A 


1. Prove that if A and B are two sets in A with A C B, then m(A) < m(B). This property is 
called monotonicity. 


2. Prove that if there is a set A in the collection A for which m( A) < oo, then m(@) = 0. 

3. Let {E;,}?2, be a countable collection of sets in A. Prove that m(Ue, Ex) < De, m( Ex). 

4. A set function c, defined on all subsets of R, is defined as follows. Define c(E) to be oo if 
E has infinitely many members and c(E) to be equal to the number of elements in E if E 


is finite; define c(@) = 0. Show that c is a countably additive and translation invariant set 
function. This set function is called the counting measure. 


2.2 LEBESGUE OUTER MEASURE 


Let J be a nonempty interval of real numbers. We define its length, £(J), to be oo if J 1s 
unbounded and otherwise define its length to be the difference of its endpoints. For a set 
A of real numbers, consider the countable collections {J,}?2., of nonempty open, bounded 
intervals that cover A, that is, collections for which A C Uj? J. For each such collection, 
consider the sum of the lengths of the intervals in the collection. Since the lengths are positive 
numbers, each sum is uniquely defined independently of the order of the terms. We define 
the outer measure” of A, m*(A), to be the infimum of all such sums, that is 


m*(A) = inf 3 (I) | A CU i} 
k=1 k=1 


It follows immediately from the definition of outer measure that m*(@) = 0. Moreover, since 
any cover of a set B is also a cover of any subset of B, outer measure is monotone in the 
sense that 


if AC B, then m*(A) <m*(B). 


Example A countable set has outer measure zero. Indeed, let C be a countable set 
enumerated as C = {c,}°°,. Lete>0. For each natural number k, define I, = (cx—€/2°", cx+ 
¢/2'*1), The countable collection of open intervals {1;}°°, covers C. Therefore 


0<m*(C)< S l(k) = S e/2* =e. 
k=1 k=1 


This inequality holds for each « > 0. Hence m*( E) = 0. 


Proposition 1 The outer measure of an interval is its length. 


3There is a general concept of outer measure, which will be considered in Part III. The set function m* is a 
particular example of this general concept, which is properly identified as Lebesgue outer measure on the real line. 
In Part I, we refer to m* simply as outer measure. 


32 Chapter2 Lebesgue Measure 


Proof We begin with the case of a closed, bounded interval [a, b]. Let « > 0. Since the open 
interval (a—¢, b+) contains [a, b] we have m*([a, b]) < £((a—€, b+€)) = b—a+2e. This 
holds for any « > 0. Therefore m*([a, b]) < b—a. It remains to show that m*([a, b]) > b—a. 
But this is equivalent to showing that if {J,}°°, is any countable collection of open, bounded 
intervals covering [a, b], then 

CO 

SY lh) = b= a. (1) 

k=1 . 
By the Heine-Borel Theorem,’ any collection of open intervals covering [a, b] has a finite 
subcollection that also covers [a, b]. Choose a natural number n for which {I}p_, covers 
[a, b]. We will show that 

n 

> hk) = b=, (2) 

k=1 
and therefore (1) holds. Since a belongs to U?_, Jz, there must be one of the J;,’s that contains 
a. Select such an interval and denote it by (a, b,). We have a, <a < b,. If b; > b, the 
inequality (2) is established since 


n 
Sy lk) > bi —a, > b—a. 
k=1 


Otherwise, b; €[a, b), and since b; ¢ (a, b; ), there is an interval in the collection (Tk }e_ 
which we label (ap, bz), distinct from (a1, b;), for which b, € (ap, b>); that is, az < by < by. 
If bp => b, the inequality (2) is established since 


n 
Sd Uk) = (bi — a1) + (bp — a2) = bn — (a — 1) a, >) — a, > 
k=1 


We continue this selection process until it terminates, as it must since there are only n 
intervals in the collection {1,};_1. Thus we obtain a subcollection {(ax, by a of {I,}7_, for 
which 


ay <a, 
while 
Qa, <b for 1 <k < N—1, 


and, since the selection process terminated, 
by > b. 


Thus 
n N 


2 Ee) =X e((ai,bi)) 
= (by — aw) + (by-1 — ay-1) +++ + (bi — a1) 
= by — (ay — by_1) -...- (a) -b1) — a1 
> by -— a, >b-a. 


4See page 18. 
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Thus the inequality (2) holds. 


If J is any bounded interval, then given € > 0, there are two closed, bounded intervals 
J; and Jp such that 
A CICH 


while 
(1) —€ <e(4,) and (Jy) < £(7) +. 


By the equality of outer measure and length for closed, bounded intervals and the mono- 
tonicity of outer measure, 


(I) —€< (J) =m*() <m*(1) <m*() =U) <L(1) +6. 


This holds for each € > 0. Therefore 2(7) = m*(J). 


If J is an unbounded interval, then for each natural number 7, there is an interval J C J 
with £(J) =n. Hence m*(I) > m*(J) = €( J) =n. This holds for each natural number n. 
Therefore m*(I) = ov. | O 


Proposition 2 Outer measure is translation invariant, that is, for any set A and number y, 
m*(A+y)=m*(A). 


Proof Observe that if {J,}?0, is any countable collection of sets, then {J,}7° , covers A if and 
only if {J + y}72, covers A + y. Moreover, if each J, is an open interval, then each J; + y is 
an open interval of the same length and so 


Oo CO 


> he) = > Ue + y). 
k=1 k=1 
The conclusion follows from these two observations. LO 


Proposition 3 Outer measure is countably subadditive, that is, if {E,}°°., is any countable 
collection of sets, disjoint or not, then 


m(O ay < S m*( Ex). 
k=1 k=1 


Proof If one of the E;’s has infinite outer measure, the inequality holds trivially. We 
therefore suppose each of the E;’s has finite outer measure. Let ¢ > 0. For each natural 
number k, there is a countable collection {J;,;}?°, of open, bounded intervals for which 


ee) 0° 
Ex cU I, and S l(Ik.i) < m*( Ex) + €/2*. 
i=1 i=1 


Now {J;i}1<k,i<oo is a countable collection of open, bounded intervals that covers Ure, Ex: 
the collection is countable since it is a countable collection of countable collections. Thus, 
by the definition of outer measure, 
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“(0 & | < > ki) = > Pra) 
k=1 = 


1<k,i<oo k= 
CO 
<> Im* (Ex) + «/2 
k=1 
CO 
=| > m*(Ex)| +. 
k=1 
Since this holds for each € > 0, it also holds for « = 0. The proof is complete. L 


If {E;};_, is any finite collection of sets, disjoint or not, then 
n n 
m* (_) Ex < S) m*( Ex). 
k=1 k=1 


This finite subadditivity property follows from countable subadditivity by taking E, = @ 
fork >n. 


PROBLEMS 
5. By using properties of outer measure, prove that the interval [0, 1] is not countable. 
6. Let A be the set of irrational numbers in the interval [0, 1]. Prove that m*(A) = 1. 


7. A set of real numbers is said to be a Gs set provided it is the intersection of a countable 
collection of open sets. Show that for any bounded set E, there is a Gs set G for which 


ECG and m*(G) =m*(E). 


8. Let B be the set of rational numbers in the interval [0, 1], and let {J,}"_, be a finite collection 
of open intervals that covers B. Prove that )7_, m*(I,) > 1. 
9. Prove that if m*(A) = 0, then m*(A U B) = m*(B). 


10. Let A and B be bounded sets for which there is an a>0 such that |a—b| > aforallac A, bE B. 
Prove that m*(A U B) = m*(A)+m*(B). 


2.3 THE o-ALGEBRA OF.LEBESGUE MEASURABLE SETS 


Outer measure has four virtues: (i) it is defined for all sets of real numbers, (ii) the outer 
measure of an interval is its length, (iii) outer measure is countably subadditive, and (iv) 
outer measure is translation invariant. But outer measure fails to be countably additive. In 
fact, it is not even finitely additive (see Theorem 18): there are disjoint sets A and B for 
which 


m*(AU B) <m*(A)+m*(B). (3) 
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To ameliorate this fundamental defect we identify a o-algebra of sets, called the 
Lebesgue measurable sets, which contains all intervals and all open sets and has the property 
that the restriction of the set function outer measure to the collection of Lebesgue measurable 
sets is countably additive. There are a number of ways to define what it means for a set to be 
measurable.” We follow an approach due to Constantine Carathéodory. 


Definition A set E is said to be measurable provided for any set A,° 
m*(A) =m*(ANE)+m*(ANE*). 


We immediately see one advantage possessed by measurable sets, namely, that the 
strict inequality (3) cannot occur if one of the sets is measurable. Indeed, if, say, A is 
measurable and B is any set disjomt from A, then 

m*(AUB) =m*([AU B]N A)) +m*([AU B)]M AC) = m*(A) + m*(B). 


Since, by Proposition 3, outer measure is finitely subadditive and A = [ANE]U[ANE‘], 
we always have 
m*(A) <m*(AN E)+m*(ANE‘). 
Therefore E is measurable if and only if for each set A we have 
m*(A) > m*(ANE)+m*(ANE*). (4) 


This inequality trivially holds if m*(A) = oo. Thus it suffices to establish (4) for sets A that 
have finite outer measure. 

Observe that the definition of measurability is symmetric in E and E°, and therefore 
a set is measurable if and only if its complement is measurable. Clearly the empty-set # and 
the set R of all real numbers are measurable. 


Proposition 4 Any set of outer measure zero is measurable. In particular, any countable set 
is measurable. 


Proof Let the set E have outer measure zero. Let A be any set. Since 
ANECEand AN ES CA, 
by the monotonicity of outer measure, 
m*(AME) <m*(E) =0and m*(AN E°) < m*(A). 
Thus, 
m*(A) > m*(AN E©) =0+m*(AN E°) = m*(ANE) +m*(ANES), 
and therefore EF is measurable. L 


>We should fully identify what we here call a measurable set as a Lebesgue measurable subset of the real line. A 
more general concept of measurable set will be studied in Part III. However, there will be no confusion in the first 
part of this book in simply using the adjective measurable. 

Recall that for a set E, by E° we denote the set {x € R| x ¢ E}, the complement of E in R. We also denote E© by 
R~ E. More generally, for two sets A and B, we let A ~ B denote {a € A | x ¢ B} and call it the relative complement 
of Bin A. 
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Proposition 5 The union of a finite collection of measurable sets is measurable. 


Proof As a first step in the proof, we show that the union of two measurable sets FE, and E> 
is measurable. Let A be any set. First using the measurability of E,, then the measurability 
of E>, we have 


m*(A) = m*(ANE;) +m*(AN ES) 


= m*( ANE, )+m*([AN Ey |NE2)+m*([ANECINES ). 
There are the following set identities: 
[AN E{]N ES = AN[E, U Ey]° 


and 
[AN E,\]U[AN Ef N Ey] = AN[E, U EY]. 
We infer from these identities and the finite subadditivity of outer measure that 


m*(A) = m*(AN Ey) + m*([AN Ef]N Ez) +m*([AN EC] N ES) 
= m*(AN E,)+m*([AN EC]N Ey) +m*(AN[E, U Ey]°) 


> m*(AN[E, U Ey]) +m*(AN[E, U Ey]°). 
Thus EF) U E> is measurable. 


Now let {E,};_, be any finite collection of measurable sets. We prove the measurability 
of the union (?_, Ex, for general n, by induction. This is trivial for n = 1. Suppose it is true 
for n — 1. Thus, since 

n n—1 
J E. = Unive, 
k=1 k=1 
and we have established the measurability of the union of two measurable sets, the set 
U;_, Ex 1s measurable. Cl 


Proposition 6 Let A be any set and {Ex};_, 4 finite disjoint collection of measurable sets. 


Then 
m* [4 a 


m(U F | = ¥ m*(E;). 
k=1 k=1 


UL ay = s m*(AN Ex). 


k=1 k=1 
In particular, 


Proof The proof proceeds by induction on n. It is clearly true for n = 1. Assume it is true 
for n — 1. Since the collection {E,}7_, 1s disjoint, 


AN 


n 
UniJoe=ane, 
k=1 
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k=1 


A n|U]res= ana] 


Hence, by the measurability of E,, and the induction assumption, 


n n—-1 
m* [4 N}|U Ex = m*(ANE,) +m* [4 ae E; | 
k=1 k=1 
n—1 
=m*(ANE,)+ > m*(AN Ex) 
k=1 
= m*(AN Ex). 
2 4 


A collection of subsets of R is called an algebra provided it contains R and is closed 
with respect to the formation of complements and finite unions; by De Morgan’s Identities, 
such a collection is also closed with respect to the formation of finite intersections. We infer 
from Proposition 5, together with the measurability of the complement of a measurable set, 
that the collection of measurable sets is an algebra. It is useful to observe that the union of 
a countable collection of measurable sets is also the union of a countable disjoint collection 
of measurable sets. Indeed, let {A;,}?2., be a countable collection of measurable sets. Define 
A, = A, and for each k > 2, define 


k-1 
Ay = Ax~ (U Ai. 
~ 


Since the collection of measurable sets is an algebra, {A/,}7°, is a disjoint collection of 
measurable sets whose union is the same as that of {Ax}7°). 


Proposition 7 The union of a countable collection of measurable sets is measurable. 


Proof Let E be the union of acountable collection of measurable sets. As we observed above, 
there is a countable disjoint collection of measurable sets {E,}7°, for which E = U72, Ex. 
Let A be any set. Let n be a natural number. Define F,, = Uj_, Ex. Since F;, is measurable 
and FD ES, 


m*(A) =m*(AN Fx) +m*(AN FC) > m*(AN F,) +m*(ANE*). 


By Proposition 6, 
m*(AN F,) = > m*(AN Ex). 
k=1 
Thus 


m*(A) > s m*(AN Ex) +m*(ANE‘). 
k=l 
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The left-hand side of this inequality is independent of n. Therefore 
oO 

m*(A)> Si m*(AN Ey) + m*(A 0 E°) | 
k=1 


Hence, by the countable subadditivity of outer measure, 
m*(A) >m*(ANE) + m*(A n Eo) | 
Thus E is measurable. U 


A collection of subsets of R is called an o-algebra provided it contains R and is 
closed with respect to the formation of complements and countable unions: by De Morgan’s 
Identities, such a collection is also closed with respect to the formation of countable 
intersections. The preceding proposition tells us that the collection of measurable sets is a 
o-algebra. 


Proposition 8 Every interval is measurable. 


Proof As we observed above, the measurable sets are a a-algebra. Therefore to show that 
every interval is measurable it suffices to show that every interval of the form (a, 00) is 
measurable (see Problem 11). Consider such an interval. Let A be any set. We assume a does 
not belong to A. Otherwise, replace A by A ~ {a}, leaving the outer measure unchanged. We 
must show that 


m™(A,) +m*(A2) < m*(A), (5) 


where 
A; = AN (—o, a) and Az = AN (a, 0). 


By the definition of m*( A) as an infimum, to verify (5) it is necessary and sufficient to show 
that for any countable collection {I,}7°., of open, bounded intervals that covers A, 

oO 

m*(A,) + m*(A2) < S Uk). (6) 

k=1 

Indeed, for such a covering, for each index k, define 
I, = I, (-00, a) and I = i, N (a, 00) 
Then J, and J/ are intervals and 
(I) = €(T,) + (7). 


Since {I,}7°., and {I7}°, are countable collections of open, bounded intervals that cover A; 
and Ap, respectively, by the definition of outer measure, 


m*(A,) < S (J,,) and m*( Az) < s (I, ). 
k=1 k=1 
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Therefore 
OO OO 
m*(Ai)+m*(A1) < ¥ eh) + & CY), 
k=1 k=1 
a / Hf 
= et) + £(1/)] 
OO 
=> €( Ik). 
k=1 
Thus (6) holds and the proof is complete. U 


Every open set is the disjoint union of a countable collection of open intervals.’ We 
therefore infer from the two preceding propositions that every open set is measurable. Every 
closed set is the complement of an open set and therefore every closed set is measurable. 
Recall that a set of real numbers is said to be a Gg set provided it is the intersection of 
a countable collection of open sets and said to be an F, set provided it is the union of a 
countable collection of closed sets. We infer from Proposition 7 that every Gs set and every 
F, set is measurable. 

The intersection of all the o-algebras of subsets of R that contain the open sets is a 
o-algebra called the Borel o-algebra; members of this collection are called Borel sets. The 
Borel o-algebra is contained in every o-algebra that contains all open sets. Therefore, since 
the measurable sets are a o-algebra containing all open sets, every Borel set is measurable. 
We have established the following theorem. 


Theorem 9 The collection M of measurable sets is a c-algebra that contains the o-algebra 
B of Borel sets. Each interval, each open set, each closed set, each Gs set, and each F,, set is 
measurable. 


Proposition 10 The translate of a measurable set is measurable. 


Proof Let E be a measurable set. Let A be any set and y be a real number. By the 
measurability of E and the translation invariance of outer measure, 


m*(A) = m*(A—y) =m*([A— y]NE) +m*([A ~y}n E) 


= m*(AN[E+ y]) +m*(A N[E+ yI°). 
Therefore E + y is measurable. UO 


PROBLEMS 


11. Prove that if a o-algebra of subsets of R contains intervals of the form (a, oo), then it contains 
all intervals. 


12. Show that every interval is a Borel set. 


See page 17. 
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13. Show that (i) the translate of an F, set is also F,, (ii) the translate of a Gs set is also Gs, and 
(iii) the translate of a set of measure zero also has measure zero. 


14. Show that if a set E has positive outer measure, then there is a bounded subset of E that also 
has positive outer measure. 


15. Show that if E has finite measure and ¢ > 0, then E is the disjoint union of a finite number of 
measurable sets, each of which has measure at most e. 


2.4 OUTER AND INNER APPROXIMATION OF LEBESGUE MEASURABLE SETS 


We now present two characterizations of measurability of a set, one based on inner approx- 
imation by closed sets and the other on outer approximation by open sets, which provide 
alternate angles of vision on measurability. These characterizations will be essential tools for 
our forthcoming study of approximation properties of measurable and integrable functions. 

Measurable sets possess the following excision property: If A is a measurable set of 
finite outer measure that is contained in B, then 


m*(B~ A) =m*(B)—m*(A). (7) 
Indeed, by the measurability of A, 
m*(B) = m*(BN A) +m*(BN A°) =m*(A)+m*(B~A), 


and hence, since m*( A) < oo, we have (7). 


Theorem 11 Let E be any set of real numbers. Then each of the following four assertions is 
equivalent to the measurability of E. 
(Outer Approximation by Open Sets and Gs Sets) 


(i) For each e > 0, there is an open set O containing E for which m*(O~ E) <e. 
(ti) There is a Gs set G containing E for which m*(G~ E) = 0. 
(Inner Approximation by Closed Sets and F,, Sets) 


(iii) For each € > 0, there is a closed set F contained in E for which m*(E~ F) <e. 
(iv) There is an F, set F contained in E for which m*(E~ F) = 0. 


Proof We establish the equivalence of the measurability of E with each of the two outer 
approximation properties (i) and (ii). The remainder of the proof follows from De Morgan’s 
Identities together with the observations that a set is measurable if and only if its complement 
is measurable, is open if and only if its complement is closed, and is F, if and only if its 
complement is Gs. 


Assume E is measurable. Let € > 0. First consider the case that m*( E) < oo. By the 
definition of outer measure, there is a countable collection of open intervals {1}7°, which 
covers E and for which 


S l( Ik) <m*(E) +e. 
k=1 


Define O = U7? J;. Then O is an open set containing E. By the definition of the outer 
measure of O, 


m*(Q) < S e(I,) <m*(E) +6, 
k=1 
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so that 
m*(O) — m*(E) <e. 


However, E is measurable and has finite outer measure. Therefore, by the excision property 
of measurable sets noted above, 


m*(O~ E) =m*(O) —m*(E) <e. 


Now consider the case that m*( E) = oo. Then E may be expressed as the disjoint union of 
a countable collection {E,}°, of measurable sets, each of which has finite outer measure. 
By the finite measure case, for each index k, there is an open set O; containing FE; for which 
m*( Ox ~ Ex) < €/2*. The set O =, O; is open, it contains E and 


0O 0O 
O~E=|JO.~EC\|J[Ox ~ Ex]. 
k=1 k=1 


Therefore x0 so 
m*(O~E)) < > m*(Oy~ Ex) < > /2* =e. 
k=1 k=1 
Thus property (i) holds for E. 
Now assume property (i) holds for E. For each natural number k, choose an open set 
O that contains E and for which m*(O; ~ E) <1/k. Define G = (7°, Ox. Then G is a Gg set 
that contains E. Moreover, since for each k, G~ EC Oy ~ E, by the monotonicity of outer 


measure, 
m*(G~E) <m*(O,~E) <1/k. 


Therefore m*(G ~~ E) = 0 and so (ii) holds. Now assume property (ii) holds for E. Since a 
set of measure zero is measurable, as is a Gs set, and the measurable sets are an algebra, the 
set 

E=Gn[G~eE]© 


is measurable. CJ 


The following property of measurable sets of finite outer measure asserts that such sets 
are “‘nearly’”’ equal to the disjoint union of a finite number of open intervals. 


Theorem 12 Let E be a measurable set of finite outer measure. Then for each € > 0, there is a 
finite disjoint collection of open intervals {I}, for which if O = Up, Ik. then® 


m*(E~O)+m*(O~E) <e. 
Proof According to assertion (i) of Theorem 11, there is an open set U/ such that 
ECU and m*(U~E) <é/2. (8) 


8For two sets A and B, the symmetric difference of A and B, which is denoted by AAB, is defined to be the set 
[A ~ B] U[B~ A]. With this notation the conclusion is that m*(E AO) <e. 
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Since E is measurable and has finite outer measure, we infer from the excision property 
of outer measure that U/ also has finite outer measure. Every open set of real numbers 
is the disjoint union of a countable collection of open intervals.? Let U/ be the union of 
the countable disjoint collection of open intervals {I,}7°,. Each interval is measurable and 
its outer measure is its length. Therefore, by Proposition 6 and the monotonicity of outer 
measure, for each natural number n, 


S €(Ik) = m(U | < m*(U) < OO. 
k=1 k=1 

The right-hand side of this inequality is independent of n. Therefore 

oO 

S e( I) < OO. 

k=1 
Choose a natural number n for which 

oO 

S e( I) < €/2. 

k=n+1 
Define O = Uj_) Ix. Since O~ E CU ~ E, by the monotonicity of outer measure and (8), 
m*(O~ E) <m*(U~E) <é/2. 
On the other hand, since E CU, 
(,@) 
E~OCU~O= |J k, 
k=n+1 
so that by the definition of outer measure, 
(,@) 
m*(E~O)< > l(h) <é/2. 
k=n+1 
Thus 
m*(O~ E)+m*(E~O) <e. O 


Remark A comment regarding assertion (i) in Theorem 11 is in order. By the definition 
of outer measure, for any bounded set E, regardless of whether or not it is measurable, and 
any € > 0, there is an open set O such that E C O and m*(O) < m*(E) + € and therefore 
m*(O) — m*(E) <e. This does not imply that m*(O ~ E) < €, because the excision property 


m*(O~ E) = m*(O) —m*(E) 
is false unless E is measurable (see Problem 19). 


See page 17. 
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PROBLEMS 


16. Complete the proof of Theorem 11 by showing that measurability is equivalent to (iii) and 
also equivalent to (iv). 


17. Show that a set E is measurable if and only if for each € > 0, there is a closed set F and open 
set O for which F C E CO and m*(O~ F) <e 


18. Let E have finite outer measure. Show that there is an F, set F and a Gs set G such that 


FC ECG and m*(F) = m*(E) =m*(G). 


19. Let E have finite outer measure. Show that if E is not measurable, then there is an | open set 
O containing E that has finite outer measure and for which 


m*(O~ E)>m* (0) —m* (E). 


20. (Lebesgue) Let E have finite outer measure. Show that E is measurable if and only if for each 
open, bounded interval (a, b), 


b—a=m"((a,b)NE)+m*((a, b)~E). 


21. Use property (ii) of Theorem 11 as the primitive definition of a measurable set and prove 
that the union of two measurable sets is measurable. Then do the same for property (iv). 


22. For any set A, define m**( A) €[0, oo] by 
m**(A) = inf {m*(O) | OD A,O open.} 


How is this set function m** related to outer measure m*? 


23. For any set A, define m***(A) €[0, 00] by 
m***(A) = sup {m*(F) | FCA, F closed.} 


How is this set function m*** related to outer measure m*? 


2.5 COUNTABLE ADDITIVITY, CONTINUITY, AND THE BOREL-CANTELLI LEMMA 


Definition The restriction of the set function outer measure to the class of measurable sets 
is called Lebesgue measure. It is denoted by m, so that if E is a measurable set, its Lebesgue 
measure, m( E), is defined by 
m(E)=m*(E). 
The following proposition is of fundamental importance. 


Proposition 13 Lebesgue measure is countably additive, that is, if {E;};°., is a countable 
disjoint collection of measurable sets, then its union \)7~., Ex also is measurable and 


“(0 Bt -¥ m( Ex). 
k=1 k=1 
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Proof Proposition 7 tells us that U?°, Ex is measurable. According to Proposition 3, outer 
measure is countably subadditive. Thus 


“(0 & | < s m( Ex). (9) 
k=1 k=1 


It remains to prove this inequality in the opposite directon. According to Proposition 6, for 
each natural number n, 


o(U | = s m( Ex). 
k=1 k=1 


Since U7? Ex; contains 7_, Ex, by the monotonicity of outer measure and the preceding 
equality, 


o(O & | > s m(E; ) for each n. 


k=1 k=1 
The left-hand side of this inequality is independent of n. Therefore 


“(0 B| > S m( Ex). (10) 
k=l k=l 


From the inequalities (9) and (10) it follows that these are equalities. L 


According to Proposition 1, the outer measure of an interval is its length while 
according to Proposition 2, outer measure is translation invariant. Therefore the preceding 
proposition completes the proof of the following theorem, which has been the principal goal 
of this chapter. 


Theorem 14 The set function Lebesgue measure, defined on the a-algebra of Lebesgue 
measurable sets, assigns length to any interval, is translation invariant, and is countable 
additive. 


A countable collection of sets {E,}?°, is said to be ascending provided for each k, 
Ex © Ex41, and said to be descending provided for each k, Ex.) C Ex. 


Theorem 15 (the Continuity of Measure) Lebesgue measure possesses the following conti- 
nuity properties: 


(t) If {Ax}°°., is an ascending collection of measurable sets, then 


n(C a] = lim m(Ax). (11) 
k=1 k> oo 


(it) If {By}7°, is a descending collection of measurable sets and m( B,) < 00, then 


(A a = im m( Bx). (12) 
kel > 00 
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Proof We first prove (i). If there is an index ko for which m(A,,) = ov, then, by the 
monotonicity of measure, m (I, Ax) = 00 and m( Ax) = 00 for all k > ko. Therefore (11) 
holds since each side equals oo. It remains to consider the case that m(A;) < oo for all k. 
Define Ag = @ and then define C, = Ay ~ Ay_ for each k > 1. By construction, since the 
sequence {Ax}; , is ascending, 


1, @) 1, @) 
{Ci }f2, is disjoint and |_) Ax = (LJ Cx. 
k=1 k=1 
By the countable additivity of m, 
oO 1, @) oO 
m (_) Ax =m (_) Ce = Si m(Ax~ Ax-1). (13) 
k=1 k=1 k=1 


Since {A;}?-, is ascending, we infer from the excision property of measure that 


S m( Ax ~ Ag_1) = x [m( Ax) — m(Ax-1)] 


k=1 
“tim... 3 m( Ae) - m(Ax-1)] 
= limy + oolm(An) — m(Ao)].- 


(14) 


Since m( Ag) = m(@) = 0, (11) follows from (13) and (14). 


To prove (ii) we define D, = B, ~ By for each k. Since the sequence {By}7°, is 
descending, the sequence {D,;}?°, is ascending. By part (i), 


1, @) 
m (G D, = lim m( Dx). 
According to De Morgan’s Identities, 


oo ie) oO 
(_) Dy =| [Bi ~ Be] = Bi~ (-) Be- 
k=1 k=1 k=1 


On the other hand, by the excision property of measure, for each k, since m( By) < 00, 
m( D,) = m(B,) — m( By). Therefore 


n( a ~ Fm) = lim 1 [m( Bi) - m(B,)]. 
k=1 


Once more using excision we obtain the equality (12). O 


For a measurable set E, we say that a property holds almost everywhere on E, or it 
holds for almost all x ¢ E, provided there is a subset Eo of E for which m( Ep) = 0 and the 
property holds for all xe E~ Eo. 
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The Borel-Cantelli Lemma Let {Ex}~-, be a countable collection of measurable sets for 
which Yy2., m( Ex) < 00. Then almost all x € R belong to at most finitely many of the E;,’s. 
Proof For each n, by the countable subadditivity of m, 

oO oO 

m| |) Ex.) < S| m( Ex) < 00. 

k=n k=n 

Hence, by the continuity of measure, 


1, @) 
U & 


“(Alo 


n=1 


(0, @) (0, @) 
| = pti m(L) Ex) < rlim, 2 m(E;,) = 0. 
Therefore almost all x € R fail to belong to ‘alent UR, Ex| and therefore belong to at most 


finitely many E;’s. O 


The set function Lebesgue measure inherits the properties possessed by Lebesgue 
outer measure. For future reference we name some of these properties. 


(Finite Additivity) For any finite disjoint collection { Ex};_, of measurable sets, 
n n 
m\ |_J Ex | = >. m( Ex). 
k=1 k=1 
(Monotonicity) If A and B are measurable sets and A C B, then 
m(A) < m(B). 
(Excision) If, moreover, A C B and m( A) < oo, then 


m( B~ A) = m(B)-—m(A), 


so that if m(A) = 0, then 
m(B~ A) =m/(B). 


(Countable Monotonicity) For any countable collection {E; }2., of measurable sets 
that covers a measurable set E, 


0O 
m(E) < > m(E;). 
k=1 
Countable monotonicity is an amalgamation of the monotonicity and countable sub- 
additivity properties of measure that is often invoked. 
Remark In our forthcoming study of Lebesgue integration it will be apparent that it is the 


countable additivity of Lebesgue measure that provides the Lebesgue integral with its decisive 
advantage over the Riemann integral. 
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PROBLEMS 
24. Show that if E, and E> are measurable, then 


m(E; U E>) + m(E; N E>) = m(E;) +m( E>). 


25. Show that the assumption that m(B,) < oo is necessary in part (ii) of the theorem regarding 
continuity of measure. 


26. Let {Ex}, be a countable disjoint collection of measurable sets. Prove that for any set A, 


m(4 AU Bi = s m*(AN Ex). 


k=1 =1 


27. Let M’ be any o-algebra of subsets of R and m’ a set function on M’ which takes values in 
[0, co], is countably additive, and such that m’(@) = 0. 
(i) Show that m’ is finitely additive, monotone, countably monotone, and possesses the 
excision property. 
(ii) Show that m’ possesses the same continuity properties as Lebesgue measure. 


28. Show that continuity of measure together with finite additivity of measure implies countable 
additivity of measure. 


2.6 NONMEASURABLE SETS 


We have defined what it means for a set to be measurable and studied properties of the 
collection of measurable sets. It is only natural to ask if, in fact, there are any sets that fail to 
be measurable. The answer is not at all obvious. 

We know that if a set E has outer measure zero, then it is measurable, and since any 
subset of EF also has outer measure zero, every subset of E is measurable. This is the best that 
can be said regarding the inheritance of measurability through the relation of set inclusion: 
we now show that if E is any set of real numbers with positive outer measure, then there are 
subsets of E that fail to be measurable. 


Lemma 16 Let E be a bounded measurable set of real numbers. Suppose there is a bounded, 
countably infinite set of real numbers A for which the collection of translates of E, {A+ E}y¢4, 
is disjoint. Then m( E) = 0. 


Proof The translate of a measurable set is measurable. Thus, by the countable additivity of 
measure over countable disjoint unions of measurable sets, 


= ¥) m(A+ E). (15) 


AEA 


"U (A+ E) 


AeA 


Since both E and A are bounded sets, the set U, <4 (A+ £) also is bounded and therefore has 
finite measure. Thus the left-hand side of (15) is finite. However, since measure is translation 
invariant, m(A + E) = m(E) > 0 for each A€ A. Thus, since the set A is countably infinite 
and the right-hand sum in (15) is finite, we must have m(E) = 0. O 
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For any nonempty set E of real numbers, we define two points in E to be rationally 
equivalent provided their difference belongs to Q, the set of rational numbers. It is easy to see 
that this is an equivalence relation, that is, it is reflexive, symmetric, and transitive. We call it 
the rational equivalence relation on E. For this relation, there is the disjoint decomposition 
of E into the collection of equivalence classes. By a choice set for the rational equivalence 
relation on E we mean a set Cz consisting of exactly one member of each equivalence class. 
We infer from the Axiom of Choice!® that there are such choice sets. A choice set Cr is 
characterized by the following two properties: 


(i) the difference of two points in Cg is not rational; 
(ii) for each point x in E, there is a point c in Cg for which x = c + q, with g rational. 


This first characteristic property of Cz may be conveniently reformulated as follows: 
For any set AC Q, {A+Cz}),¢< is disjoint. (16) 


Theorem 17 (Vitali) Any set E of real numbers with positive outer measure contains a subset 
that fails to be measurable. 


Proof By the countable subadditivity of outer measure, we may suppose E is bounded. Let 
Cr be any choice set for the rational equivalence relation on E. We claim that Cz is not 
measurable. To verify this claim, we assume it is measurable and derive a contradiction. 


Let Ag be any bounded, countably infinite set of rational numbers. Since Cr is 
measurable, and, by (16), the collection of translates of Cz by members of Ao is disjoint, it 
follows from Lemma 16 that m(Cz) = 0. Hence, again using the translation invariance and 
the countable additivity of measure over countable disjoint unions of measurable sets, 


m| (J (A+Ce)| = 3 m(AtCz) =0. 
AE Ag AE Ng 


To obtain a contradiction we make a special choice of Ag. Because E is bounded it is 
contained in some interval [—b, b]. We choose 


Ao = [—2b, 2b] NQ. 


Then Ag is bounded, and is countably infinite since the rationals are countable and dense.!! 
We claim that 


EC (J (A+Cz). (17) 
Ae [—2b, 2b]IN@ 


Indeed, by the second characteristic property of Cz, if x belongs to E, there is a number c in 
the choice set Cz for which x = c + q with q rational. But x and c belong to [—b, b], so that g 
belongs to [—2b, 2b]. Thus the inclusion (17) holds. This is a contradiction because E, a set 
of positive outer measure, is not a subset of a set of measure zero. The assumption that Cr 
is measurable has led to a contradiction and thus it must fail to be measurable. L] 


10See page 5. 
‘I See pages 12 and 14. 
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Theorem 18 There are disjoint sets of real numbers A and B for which 
—m*(AUB) <m*(A) +m*(B). 


Proof We prove this by contradiction. Assume m*(A U B) = m*(A) + m*(B) for every 
disjoint pair of sets A and B. Then, by the very definition of measurable set, every set must 
be measurable. This contradicts the preceding theorem. O 


PROBLEMS 
29. (i) Show that rational equivalence defines an equivalence relation on any set. 
(ii) Explicitly find a choice set for the rational equivalence relation on Q. 


(iii) Define two numbers to be irrationally equivalent provided their difference is irrational. 
Is this an equivalence relation on R? Is this an equivalence relation on Q? 


30. Show that any choice set for the rational equivalence relation on a set of positive outer 
measure must be uncountably infinite. 


31. J ustify the assertion in the proof of Vitali’s Theorem that it sulfices to consider the case that 
E is bounded. | 


32. Does Lemma 16 remain true if A is allowed to be finite or to be uncountably infinite? Does it 
remain true if A is allowed to be unbounded? 


33. Let E be a nonmeasurable set of finite outer measure. Show that there is a Gs set G that 


contains E for which 
| m*( E) =m*(G), while m*(G~E) >0. 


2.7 THE CANTOR SET AND THE CANTOR-LEBESGUE FUNCTION 


We have shown that a countable set has measure zero and a Borel set is Lebesgue measurable. 
These two assertions prompt the following two questions. 


Question 1 Ifa set has measure zero, is it also countable? 


Question 2 Ifa set is measurable, is it also Borel? 


The answer to each of these questions is negative. In this section we construct a set 
called the Cantor set and a function called the Cantor-Lebesgue function. By studying these 
we answer the above two questions and later provide answers to other questions regarding 
finer properties of functions. 

Consider the closed, bounded interval J = [0, 1]. The first step in the construction of 
the Cantor set is to subdivide J mto three mtervals of equal length 1/3 and remove the 
interior of the middle interval, that is, we remove the interval (1/3, 2/3) from the interval 
(0, 1] to obtain the closed set C1, which is the union of two disjoint closed intervals, each of 


length 1/3: 
C; = (0, 1/3] U [2/3, 1]. 


We now repeat this ‘open middle one-third removal” on each of the two intervals in C; to 
obtain a closed set C2, which is the union of 2” closed intervals, each of length 1/ 3: 


Cy =[0, 1/9] U [2/9, 1/3] U [2/3, 7/9] U[8/9, 1]. 


50 Chapter2 Lebesgue Measure 


We now repeat this “open middle one-third removal” on each of the four intervals in C, 
to obtain a closed set C3, which is the union of 2° closed intervals, each of length 1/37. We 
continue this removal operation countably many times to obtain the countable collection of 
sets {C,}7°,. We define the Cantor set C by 


00 
C= a Ck. 
k=1 


The collection {C;}7° , possesses the following two properties: 
(i) {Cx}?2, is a descending sequence of closed sets; 
(ii) For each k, C, is the disjoint union of 2‘ closed intervals, each of length 1/3*. 


Proposition 19 The Cantor set C is a closed, uncountable set of measure zero. 


Proof The intersection of any collection of closed sets is closed. Therefore C is closed. Each 
closed set is measurable so that each C; and C itself is measurable. 


Now each C;, is the disjoint union of 2* intervals, each of length 1/3‘, so that by the 
finite additivity of Lebesgue measure, 


m(Cx) = (2/3). 

By the monotonicity of measure, since m(C) < m(C;,) = (2/3)*, for all k, m(C) = 0. It 
remains to show that C is uncountable. To do so we argue by contradiction. Suppose C is 
countable. Let {c,}7°, be an enumeration of C. One of the two disjoint Cantor intervals 
whose union is C; fails to contain the point c;; denote it by F,. One of the two disjoint Cantor 
intervals in C2 whose union is F; fails to contain the point c2; denote it by Fy. Continuing in 
this way, we construct a countable collection of sets {Fi }72,, which, for each k, possesses the 
following three properties: (i) F; is closed and Fx41 C Fx; (ii) Fy C Cy; and (iii) cy ¢ Fy. From 
(i) and the Nested Set Theorem!” we conclude that the intersection eo Fe is nonempty. 
Let the point x belong to this intersection. By property (ii), 


io @) io @) 
(VR CI) C=C, 
k=1 k=1 


and therefore the point x belongs to C. However, {c,}?°, is an enumeration of C so that 
x = Cy for some index n. Thus c, = x € (1), Fy C Fn. This contradicts property (iii). Hence 
C must be uncountable. 


A real-valued function f that is defined on a set of real numbers is said to be 
increasing provided f(u) < f(v) whenever u < v and said to be strictly increasing, provided 
f(u) < f(v) whenever u < v. 

We now define the Cantor-Lebesgue function, a continuous, increasing function 
defined on [0, 1] which has the remarkable property that, despite the fact that g(1) > 9(0), 
its derivative exists and is zero on a set of measure 1. For each k, let O; be the union of the 
2* — 1 intervals which have been removed during the first k stages of the Cantor deletion 
process. Thus C; = [0, 1]~ Ox. Define O = U2, Ox. Then, by De Morgan’s Identities, 
C = [0, 1] ~O. We begin by defining ¢ on O and then we define it on C. 


2S ee page 19. 
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Fix a natural number k. Define ¢ on ©, to be the increasing function on O; which is 
constant on each of its 2 — 1 open intervals and takes the 2 — 1 values 


(1/2, 2/2*, 3/2", ..., [2* — 1]/2*}. 


Thus, on the single interval removed at the first stage of the deletion process, the prescription 
for ¢ is 
g(x) =1/2if xe (1/3, 2/3). 


On the three intervals that are removed in the first two stages, the prescription for ¢ is 


1/4 ifxe(1/9, 2/9) 
y(x)=4 2/4 ifxe (3/9, 6/9) = (1/3, 2/3) 
3/4 ifxe(7/9, 8/9) 


We extend ¢ to all of [0, 1] by defining it on C as follows: 
(0) = Oand g(x) =sup {e(t) | reON[0, x) bifxeC~ {0}. 


Proposition 20 The Cantor-Lebesgue function ¢ is an increasing continuous function that 
maps [0, 1] onto [0, 1]. Its derivative exists on the open set O, the complement in [0, 1] of the 
Cantor set, 


g’ = 0o0n O while m(O) = 1. 


The graph of the Cantor-Lebesgue function on 03 = [0, 1]~C3 


Proof Since ¢ is increasing on Q, its extension above to [0, 1] also is increasing. As for 
continuity, g certainly is continuous at each point in O since for each such point belongs to 
an open interval on which it is constant. Now consider a point x9 € C with xp #0, 1. Since the 
point xo belongs to C it is not a member of the 2* — 1 intervals removed in the first k stages 
of the removal process, whose union we denote by O;. Therefore, if k is sufficiently large, xo 
lies between two consecutive intervals in O;: choose a; in the lower of these and b, in the 
upper one. The function g was defined to increase by 1/2* across two consecutive intervals 
in Ox. Therefore 
ay < x9 < by and o( by) — o(ax) = 1/2*. 
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Since k may be arbitrarily large, the function ¢ fails to have a jump discontinuity at xo. For an 
increasing function, a jump discontinuity is the only possible type of discontinuity. Therefore 
¢ is continuous at xo. If xo is an endpoint of [0, 1], a similar argument establishes continuity 
at xo. 


Since ¢ is constant on each of the intervals removed at any stage of the removal 
process, its derivative exists and equals 0 at each point in O. Since C has measure zero, its 
complement in [0, 1], O, has measure 1. Finally, since (0) = 0, ¢(1) = 1 and gis increasing 
and continuous, we infer from the Intermediate Value Theorem that ¢ maps [0, 1] onto 
[0, 1]. 


Proposition 21 Let o be the Cantor- Lebesgue function and define the function on [0, 1] by 
W(x) = p(x) +x for all x €[0, 1]. 


Then jf is a strictly increasing continuous function that maps [0, 1] onto [0, 2], 


(i) maps the Cantor set C onto a measurable set of positive measure and 
(ii) maps a measurable set, a subset of the Cantor set, onto anonmeasurable set. 


Proof The function # is continuous since it is the sum of two continuous functions and 
is strictly increasing since it is the sum of an increasing and a strictly increasing function. 
Moreover, since (0) = 0 and #(1) = 2, ([0, 1]) = [0, 2]. For O = [0, 1] ~C, we have the 
disjoint decomposition 


[0,1] =CuO 
which jf lifts to the disjoint decomposition 
[0, 2] = ¥(0) Uy(C). (18) 


A strictly increasing continuous function defined on an interval has a continuous inverse. 
Therefore ¢(C) is closed and (OQ) is open, so both are measurable. We will show that 
m((O)) = 1 and therefore infer from (18) that m(#(C)) = 1 and thereby prove (i). 


Let {1,}7°, be an enumeration (in any manner) of the collection of intervals that are 
removed in the Cantor removal process. Thus O = U®,, x. Since ¢ is constant on each x, 
i maps J, onto a translated copy of itself of the same length. Since y& is one-to-one, the 
collection {( J; )}7°, is disjoint. By the countable additivity of measure, 


mW(O)) = ¥ W(t) = ¥ ele) =m(0). 
But m(C) = 0so that m(O) = 1. Therefore m((O)) = 1 and hence, by (18), m(W(C)) = 1. 
We have established (i). 


To verify (ii) we note that Vitali’s Theorem tells us that /(C) contains a set W, which 
is nonmeasurable. The set ~!( W) is measurable and has measure zero since it is a subset of 
the Cantor set. The set ¢~!(W) is a measurable subset of the Cantor set, which is mapped 
by % onto a nonmeasurable set. 
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Proposition 22 There is a measurable set, a subset of the Cantor set, that is not a Borel set. 


Proof The strictly increasing continuous. function # defined on [0, 1] that is described in 
the preceding proposition maps a measurable set A onto a nonmeasurable set. A strictly 
increasing continuous function defined on an interval maps Borel sets onto Borel sets (see 
Problem 47). Therefore the set A is not Borel since otherwise its image under ~ would be 


Borel and therefore would be measurable. L 
PROBLEMS 
34. Show that there is a continuous, strictly increasing function on the interval [0, 1] that maps a 


35. 


36. 


37. 


38. 


39. 


40. 


41. 


42. 


43. 


44. 


45. 
46. 


47. 


set of positive measure onto a set of measure zero. 


Let f be an increasing function on the open interval J. For x9 € J show that f is continuous 
at xo if and only if there are sequences {a,} and {b,,} in J such that for each n, a, < x9 < bn, 


Show that if f is any increasing function on [0, 1] that agrees with the Cantor-Lebesgue 
function g on the complement of the Cantor set, then f = ¢ on all of [0, 1]. 


Let f be a continuous function defined on E. Is it true that f—!( A) is always measurable if A 
is measurable? 


Let the function f: [a, b] > R be Lipschitz, that is, there is a constant c > 0 such that for 
all u, ve la, bj, |f(u) — f(v)| < clu — v|. Show that f maps a set of measure zero onto a 
set of measure zero. Show that f maps an F, set onto an F, set. Conclude that f maps a 
measurable set to a measurable set. 


Let F be the subset of [0, 1] constructed in the same manner as the Cantor set except that 
each of the intervals removed at the nth deletion stage has length a@3~” with 0 < a < 1. Show 
that F is a closed set, [0, 1] ~ F dense in [0, 1], and m(F) = 1-— a. Such a set F is called a 
generalized Cantor set. 


Show that there is an open set of real numbers that, contrary to intuition, has a boundary 
of positive measure. (Hint: Consider the complement of the generalized Cantor set of the 
preceding problem.) 


A nonempty subset X of R is called perfect provided it is closed and each neighborhood of 
any point in X contains infinitely many points of X. Show that the Cantor set is perfect. (Hint: 
The endpoints of all of the subintervals occurring in the Cantor construction belong to C.) 


Prove that every perfect subset X of R is uncountable. (Hint: If X is countable, construct a 
descending sequence of bounded, closed subsets of X whose intersection is empty.) 


Use the preceding two problems to provide another proof of the uncountability of the Cantor 
set. 


A subset A of R is said to be nowhere dense in R provided that for every open set © has an 
open subset that is disjoint from A. Show that the Cantor set is nowhere dense in R. 


Show that a strictly increasing function that is defined on an interval has a continuous inverse. 


Let f be a continuous function and B be a Borel set. Show that f~!(B) is a Borel set. (Hint: 
The collection of sets E for which f—!( £) is Borel is a o-algebra containing the open sets.) 


Use the preceding two problems to show that a continuous strictly increasing function that is 
defined on an interval maps Borel sets to Borel sets. 
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We devote this chapter to the study of measurable functions in order to lay the foundation 
for the study of the Lebesgue integral, which we begin in the next chapter. All continuous 
functions on a measurable domain are measurable, as are all monotone and step functions 
on a closed, bounded interval. Linear combinations of measurable functions are measurable. 
The pointwise limit of a sequence of measurable functions is measurable. We establish 
results regarding the approximation of measurable functions by simple functions and by 
continuous functions. 


3.1 SUMS, PRODUCTS, AND COMPOSITIONS 


All the functions considered in this chapter take values in the extended real numbers, that 
is, the set R U {+00}. Recall that a property is said to hold almost everywhere (abbreviated 
a.e.) on a measurable set E provided it holds on E ~ Eo, where Ep is a subset of E for which 
m( Eo) = 0. 

Given two functions h and g defined on E, for notational brevity we often write “h < g 
on £” to mean that h(x) < g(x) for all x € E. We say that a sequence of functions { f,} on E 
is increasing provided f, < f,+41 on E for each index n. 


Proposition 1 Let the function f have a measurable domain E. Then the following statements 
are equivalent: 
(i) For each real number c, the set {x € E| f(x) > c} is measurable. 
(ii) For each real number c, the set {x € E| f(x) > c} is measurable. 
(iti) For each real number c, the set {x € E| f(x) < c} is measurable. 
(iv) For each real number c, the set {x € E| f(x) < c} is measurable. 


Each of these properties implies that for each extended real number c, 
the set {xe E| f(x) =c} is measurable. 


Proof Since the sets in (i) and (iv) are complementary in E, as are the sets in (ii) and (iii), and 
the complement in E of a measurable subset of E is measurable, (i) and (iv) are equivalent, 
as are (ii) and (iii). 
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Now (i) implies (ii), since 
oO 
{xeE| f(x)>c} =(){xeE| f(x)>c—1/k}, 
k=1 


and the intersection of a countable collection of measurable sets is measurable. Similarly, 
(ii) implies (1), since 


{xe E | re) >} =U (rek | f(x) =c+1/k}, 


and the union of a countable collection of measurable sets is measurable. 


Thus statements (i)—(iv) are equivalent. Now assume one, and hence all, of them hold. 
If c is a real number, {x ¢ E| f(x) =c} = {xe E| f(x)) >c}N{xEE£| f(x) <c}, so f-'(c) 
is measurable since it is the intersection of two measurable sets. On the other hand, if c is 
infinite, say c = 00, 


fre E| F(x) =o} =() (re | f(x) >k} 


so f~!(0o) is measurable since it is the intersection of a countable collection of measurable 
sets. a 


Definition An extended real-valued function f defined on E is said to be Lebesgue measur- 
able, or simply measurable, provided its domain E is measurable and it satisfies one of the 
four statements of Proposition 1. 


Proposition 2 Let the function f be defined on a measurable set E. Then f is measurable if 
and only if for each open set O, the inverse image of O under f, f~'(O) = {x € E| f(x) €O}, 
is measurable. 


Proof If the inverse image of each open set is measurable, then since each interval (c, oo) 
is open, the function f is measurable. Conversely, suppose f is measurable. Let O be open. 
Then! we can express O as the union of a countable collection of open, bounded intervals 
{1,}7°., where each J; may be expressed as By Ax, where By, = (—00, by) and Ax = (ax, 00). 
Since f is a measurable function, each f—!(B,) and f~!(A,) are measurable sets. On the 
other hand, the measurable sets are a c-algebra and therefore f~!(©) is measurable since 


1,6) 


po)=s"| ana) =U ayartar 7 
k=l 


k=1 


The following proposition tells us that the most familiar functions from elementary 
analysis, the continuous functions, are measurable. 


Proposition 3. A real-valued function that is continuous on its measurable domain is 
measurable. 


'See page 17. 
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Proof Let the function f be continuous on the measurable set E. Let O be open. Since f 
is continuous, f-!(O) = ENU, where U is open.2 Thus f~!(O), being the intersection 
of two measurable sets, is measurable. It follows from the preceding proposition that f is 
measurable. O 


A real-valued function that is either increasing or decreasing is said to be monotone. 
We leave the proof of the next proposition as an exercise (see Problem 24). 


Proposition 4 A monotone function that is defined on an interval is measurable. 


Proposition 5 Let f be an extended real-valued function on E. 


(i) If f is measurable on E and f = g ae. on E, then g is measurable on E. 


(ti) For a measurable subset D of E, f is measurable on E if and only if the restrictions of 
f to Dand E ~ Dare measurable. 


Proof First assume f is measurable. Define A = {x € E| f(x) # g(x)}. Observe that 
{x € E| g(x) >ch={xeA| g(x) > ch Ul{rez | f (x) > ce} N[E~ Al] 


Since f = ga.e. on E, m(A) = 0. Thus {x € A| g(x) >c} is measurable since it is a subset 
of a set of measure zero. The set {x€ E| f(x) > c} is measurable since f is measurable 
on E. Since both E and A are measurable and the measurable sets are an algebra, the set 
{x € E | g(x) >c} is measurable. To verify (ii), just observe that for any c, 


{xeE| f(x)>ch={xeD| f(x)>chU{xeE~D| f(x)>c} 


and once more use the fact that the measurable sets are an algebra. LI 


The sum f + g of two measurable extended real-valued functions f and g is not 
properly defined at points at which f and g take infinite values of opposite sign. Assume f 
and g are finite a.e. on E. Define Ep to be the set of points in E at which both f and g are 
finite. If the restriction of f + g to Eo is measurable, then, by the preceding proposition, any 
extension of f + g, as an extended real-valued function, to all of E also is measurable. This 
is the sense in which we consider it unambiguous to state that the sum of two measurable 
functions that are finite a.e. is measurable. Similar remarks apply to products. The following 
proposition tells us that standard algebraic operations performed on measurable functions 
that are finite a.e. again lead to measurable functions 


Theorem 6 Let f and g be measurable functions on E that are finite a.e. on E. 


(Linearity) For any a and B, 


af + Bg is measurable on E. 


(Products) 
f gis measurable on E. 


2See page 25. 
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Proof By the above remarks, we may assume f and g are finite on all of E. If a = 0, then 
the function af also is measurable. If a #0, observe that for a number c, 


{xeE| af(x)>ch={xeE| f(x)>c/a} ifa>0 
and 

{xeE| af(x)>ch={xeE| f(x)<c/a} ifa<0. 
Thus the measurability of f implies the measurability of af. Therefore to establish linearity 
it suffices to consider the case that a = B = 1. 


For x€ E, if f(x) + (x) <c, then f(x) <c— g(x) and so, by the density of the set of 
rational numbers Q in R, there is a rational number g for which 


f(x) <q <c— g(x). 


Hence 


{xe E | f(x) + a(x) <o =U trek | g(x) <c—q}N{xeE| f(x) <q}. 
ge 


The rational numbers are countable. Thus {x € E | f(x) + g(x) <c} is measurable, since it is 
the union of a countable collection of measurable sets. Hence f + g is measurable. 


To prove that the product of measurable functions is measurable, first observe that 


, 1 
fg= l(t +g) — f?—9’]. 


Thus, since we have established linearity, to show that the product of two measurable 
functions is measurable it suffices to show that the square of a measurable function is 
measurable. For c > 0, 


(xe E| f?(x)>c}={xeEE| f(x) > Ve}U{xeE| f(x) <-Ve} 


while for c < 0, 
{xe E| f2(x) > HE. 


Thus f? is measurable. C] 


Many of the properties of functions considered in elementary analysis, including con- 
tinuity and differentiability, are preserved under the operation of composition of functions. 
However, the composition of measurable functions may not be measurable. 


Example There are two measurable real-valued functions, each defined on all of R, whose 
composition fails to be measurable. By Lemma 21 of Chapter 2, there is a continuous, strictly 
increasing function # defined on [0, 1] and a measurable subset A of [0, 1] for which (A) 
is nonmeasurable. Extend to a continuous, strictly increasing function that maps R onto 
R. The function y—! is continuous and therefore is measurable. On the other hand, A is a 
measurable set and so its characteristic function y,4 is a measurable function. We claim that 
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the composition f = v4 07! is not measurable. Indeed, if J is any open interval containing 
1 but not 0, then its inverse image under f is the nonmeasurable set (A). 


Despite the setback imposed by this example, there is the following useful proposition 
regarding the preservation of measurability under composition (also see Problem 11). 


Proposition 7 Let g be a measurable real-valued function defined on E and f a continuous 
real-valued function defined on all of R. Then the composition f 0 g is a measurable function 
on E. 


Proof According to Proposition 2, a function is measurable if and only if the inverse image 
of each open set is measurable. Let O be open. Then 


(fog) '(O)=8'(f-'(0)). 


Since f is continuous and defined on an open set, the set / = f~!(O) is open.* We infer 
from the measurability of the function g that g~'(U/) is measurable. Thus the inverse image 
(f og)~!(O) is measurable and so the composite function f o g is measurable. L] 


An immediate important consequence of the above composition result is that if f is 
measurable with domain E, then | f| is measurable, and indeed 


| |? is measurable with the same domain E for each p > 0. 
For a finite family { f,}7_, of functions with common domain E, the function 
max{fi,..., fn} 
is defined on E by 
max{fi,..., fn}(x) = max{f,(x),..., fr(x)} for xe E. 
The function min{f;,..., f,} is defined the same way. 


Proposition 8 For a finite family { f;},_, of measurable functions with common domain E, 
the functions max{fi,..., fn} and min{f,,..., f,} also are measurable. 


Proof For any c, we have 
{xeE| max{fi,..., fal(x)>ch=U {xekE| fi(x)>c} 
k=1 


so this set is measurable since it is the finite union of measurable sets. Thus the function 
max{fi,..., f:}18s measurable. A similar argument shows that the function min{f,,..., fn} 
also is measurable. LI 


3See page 25. 
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For a function f defined on E, we have the associated functions | f|, ft, and f~ defined 


on E by 


Ifl(x) = max{f(x),—f(x)}, f"(x) = max{f(x), 0}, f-(x) = max{—f(x), 0}. 


If f is measurable on E, then, by the preceding proposition, so are the functions |f|, f*, 
and f~. This will be important when we study integration since the expression of f as the 
difference of two nonnegative functions, 


f=ft—fonE, 


plays an important part in defining the Lebesgue integral. 


10. 


11. 


PROBLEMS 


. Suppose f and g are continuous functions on [a, b]. Show that if f = g a.e. on [a, b], then, in 


fact, f = g on [a, b]. Is a similar assertion true if [a, b] is replaced by a general measurable 
set E? 


. Let D and E be measurable sets and f a function with domain DU E. We proved that f 1s 


measurable on DU E if and only if its restrictions to D and E are measurable. Is the same 
true if “measurable” is replaced by ‘“‘continuous’’? 


. Suppose a function f has a measurable domain and is continuous except at a finite number 


of points. Is f necessarily measurable? 


. Suppose f is a real-valued function on R such that f~!(c) is measurable for each number c. 


Is f necessarily measurable? 


. Suppose the function f is defined on a measurable set E and has the property that 


{x € E| f(x) >c}1s measurable for each rational number c. Is f necessarily measurable? 


. Let f be a function with measurable domain D. Show that f is measurable if and only if the 


function g defined on R by g(x) = f(x) for xe D and g(x) = 0 for x ¢ D is measurable. 


. Let the function f be defined on a measurable set E. Show that f is measurable if and only 


if for each Borel set A, f~!(A) is measurable. (Hint: The collection of sets A that have the 
property that f—!(A) is measurable is a o-algebra.) 


. (Borel measurability) A function f is said to be Borel measurable provided its domain E is a 


Borel set and for each c, the set {x € E| f(x) >c} is a Borel set. Verify that Proposition 1 and 
Theorem 6 remain valid if we replace “(Lebesgue) measurable set’’ by ““Borel set.”” Show 
that: (i) every Borel measurable function is Lebesgue measurable; (it) if f is Borel measurable 
and B is a Borel set, then f~!(B) is a Borel set; (iii) if f and g are Borel measurable, so is 
f og; and (iv) if f is Borel measurable and g is Lebesgue measurable, then f o g is Lebesgue 
measurable. 


. Let {f,} be a sequence of measurable functions defined on a measurable set E. Define Eo to 


be the set of points x in E at which { f,,(x)} converges. Is the set Ey measurable? 


Suppose f and g are real-valued functions defined on all of R, f is measurable, and ¢g 1s 
continuous. Is the composition f o g necessarily measurable? 


Let f be a measurable function and g be a one-to-one function from R onto R which has a 
Lipschitz inverse. Show that the composition f o g is measurable. (Hint: Examine Problem 
38 in Chapter 2.) 
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3.2 SEQUENTIAL POINTWISE LIMITS AND SIMPLE APPROXIMATION 


For a sequence {f,,} of functions with common domain E and a function f on E, there are 
several distinct ways in which it is necessary to consider what it means to state that 


“the sequence { f,} converges to f.” 


In this chapter we consider the concepts of pointwise convergence and uniform convergence, 
which are familiar from elementary analysis. In later chapters we consider many other modes 
of convergence for a sequence of functions. 


Definition For a sequence { f,} of functions with common domain E, a function f on E and 
a subset A of E, we say that 


(i) The sequence { f,} converges to f pointwise on A provided - 


hn fn(x) = f(x) forallxe A. 
(i) The sequence {f,} converges to f pointwise a.e. on A provided it converges to f 
pointwise on A~ B, where m(B) = 0. 
(ii) The sequence { f,} converges to f uniformly on A provided for each € > 0, there is an 
index N for which 


lf — ful<eonA foralln>N. 


When considering sequences of functions {f,} and their convergence to a function 
f, we often implicitly assume that all of the functions have a common domain. We write 
{fn} > f pointwise on A”’ to indicate the sequence { f,,} converges to f pointwise on A and 
use similar notation for uniform convergence. 

The pointwise limit of continuous functions may not be continuous. The pointwise 
limit of Riemann integrable functions may not be Riemann integrable. The following 
proposition is the first indication that the measureable functions have much better stability 
properties. 


Proposition9 Let { f,} be a sequence of measurable functions on E that converges pointwise 
a.e.on E to the function f. Then f is measurable. 


Proof Let Ep be a subset of E for which m{ Ey) = 0 and { f,} converges to f pointwise on 
E ~ Eo. Since m( Eo) = 0, it follows from Proposition 5 that f is measurable if and only if its 
restriction to E ~ Ep is measurable. Therefore, by possibly replacing E by E~ Ep, we may 
assume the sequence converges pointwise on all of E. 


Fix a number c. We must show that {x € E| f(x) <c} is measurable. Observe that for 
a point x € E, since limy -, 6 fn(x) = f(x), 


f(x)<e 
if and only if 


there are natural numbers n and k for which fj(x) <c—1/n for all j > k. 
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But for any natural numbers n and j, since the function f; is measurable, the set 
{xe€ E| f;(x) <c—1/n} is measurable. Therefore, for any k, the intersection of the countably 
collection of measureable sets 


OO 


(\{xeE| fi(x)<c—1/n} 


jak 


also is measurable. Consequently, since the union of a countable collection of measurable 
sets is measurable, 


{xeE | f(x)<c}= (J Auer fi(x) <c—-1/n} 


1<k,n<oo | j=k 


is measurable. C] 


If A is any set, the characteristic function of A, x4, is the function on R defined by 


(x) 1 ifxeA 
XAT) 0 ifxé A. 


It is clear that the function y,4 is measurable if and only if the set A is measurable. Thus 
the existence of a nonmeasurable set implies the existence of a nonmeasurable function. 
Linear combinations of characteristic functions of measurable sets play a role in Lebesgue 
integration similar to that played by step functions in Riemann integration, and so we name 
these functions. 


Definition A real-valued function o defined on a measurable set E is called simple provided 
it is measurable and takes only a finite number of values. 


We emphasize that a simple function only takes real values. Linear combinations and 
products of simple functions are simple since each of them takes on only a finite number of 
values. If g is simple, has domain E and takes the distinct values c1,..., C,, then 


n 
p= > ce- Xx, ONE, where Ey = {xeEE | p(x) =c}. 
k=1 


This particular expression of ¢ as a linear combination of characteristic functions is called 
the canonical representation of the simple function ¢. 


The Simple Approximation Lemma Let f be a measurable real-valued function on E. 
Assume f is bounded on E, that is, there is an M > 0 for which |f| < Mon E. Then for 
each « > 0, there are simple functions », and Ws, defined on E which have the following 
approximation properties: 


ge <f <P and0<e— Oe <EeonE. 
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Proof Let (c, d) be an open, bounded interval that contains the image of E, f(E), and 
C= YS <---<Yn-1 <n =a 


be a partition of the closed, bounded interval [c, d] such that y, — y,_1 <¢€ for1 <k <n. 
Defne 


I = [ye-1, ye) and Ey = f7'(y) for 1 <k <n. 


Since each J; is an interval and the function f is measurable, each set E;, is measurable. 
Define the simple functions ¢, and #, on E by 


Pe = 


n n 

Yk—1° XE, and We = > Yk XE,: 

k=1 k=1 

Let x belong to E. Since f(E) C (c,d), there is a unique k,1 < k <n, for which 
Ye-1 < f(x) < yy and therefore 


Pe(x) = ye-1 S f(x)<a = (x). 
But yx — yx_-1 <€, and therefore y, and #, have the required approximation properties. [ 


To the several characterizations of measurable functions that we already established, 
we add the following one. 


The Simple Approximation Theorem An extended real-valued function f on a measurable 
set E is measurable if and only if there is a sequence {g,} of simple functions on E which 
converges pointwise on E to f and has the property that 


lon| < |fl on E for all n. 


If f is nonnegative, we may choose {,} to be increasing. 


Proof Since each simple function is measurable, Proposition 9 tells us that a function is 
measurable if it is the pointwise limit of a sequence of simple functions. It remains to prove 
the converse. 


Assume f is measurable. We also assume f > 0 on E. The general case follows 
by expressing f as the difference of nonnegative measurable functions (see Problem 23). 
Let n be a natural number. Define E, = {xe E| f(x) < n.} Then E, is a measurable 
set and the restriction of f to E, is a nonnegative bounded measurable function. By the 
Simple Approximation Lemma, applied to the restriction of f to E, and with the choice of 
€ = 1/n, we may select simple functions ¢, and y,, defined on E, which have the following 
approximation properties: 


0<o, < f <q on E, and 0 < ff — @, <1/non Ey. 


Observe that 
O< on < f and0 < f—@, < Ym — Gn <1/non Ep. (1) 
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Extend , to all of E by setting y,(x) =n if f(x) >n. The function ¢, is a simple function 
defined on E and 0 < , < f on E. We claim that the sequence {y,} converges to f pointwise 
on E. Let x belong to E. 


Case 1: Assume f(x) is finite. Choose a natural number WN for which f(x) < N. Then 
0 < f(x) — g(x) <1/nforn > N, 


and therefore lim, 00 Wn(x) = f(x). 
Case 2: Assume f(x) =.00. Then g(x) =n for all n, so that limy 90 Gn(x) = f(x). 


By replacing each ¢, with max{g),..., Qn} we have {g,} increasing. LJ 


PROBLEMS 


12. Let f be a bounded measurable function on E. Show that there are sequences of simple 
functions on E, {gn} and {if,}, such that {g,} is increasing and {,} is decreasing and each of 
these sequences converges to f uniformly on E. 


13. A real-valued measurable function is said to be semisimple provided it takes only a countable 
number of values. Let f be any measurable function on E. Show that there 1s a sequence of 
semisimple functions { f,} on E that converges to f uniformly on E. 


14. Let f be a measurable function on E that is finite a.e.on E and m( E) < oo. For each e > 0, 
show that there is a measurable set F contained in E such that f is bounded on F and 
m(E~F) <e. 


15. Let f be a measurable function on E that is finite a.e. on E and m( E) <oo. Show that for each 
« > 0, there is a measurable set F contained in E and a sequence {¢,,} of simple functions on 
E such that {y,}—> f uniformly on F and m(E ~ F) < ¢. (Hint: See the preceding problem.) 


16. Let I be a closed, bounded interval and E a measurable subset of J. Let « > 0. Show that 
there is a step function h on J and a measurable subset F of J for which 


h= ye 0n F andm(I~F) <e. 


(Hint: Use Theorem 12 of Chapter 2.) 
17. Let J be aclosed, bounded interval and w a simple function defined on J. Let € > 0. Show that 
there is a step function h on J and a measurable subset F of J for which 
h = Won F and m(I~ F) <e. 
(Hint: Use the fact that a simple function is a linear combination of characteristic functions 
and the preceding problem.) 
18. Let I be a closed, bounded interval and f a bounded measurable function defined on J. Let 


« > 0. Show that there is a step function h on J and a measurable subset F of J for which 


|h — f|<«on F andm(I~F) <e. 


19. Show that the sum and product of two simple functions are simple as are the max and 
the min. 
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20. Let A and B be any sets. Show that 


XANB = XA -XB 
XAUB = XA + XB—XA-XB 
Xac =1—ya. 


21. For a sequence { f,} of measurable functions with common domain E, show that each of the 
following functions is measurable: 


inf {f,}, Sup {f,}, liming {f,} and lim sup {f,}. 


22. (Dini’s Theorem) Let {f,} be an increasing sequence of continuous functions on [a, 5] 
which converges pointwise on [a, b] to the continuous function f on [a, b]. Show that the 
convergence is uniform on [a, b]. (Hint: Let « > 0. For each natural number n, define 
E, = {xeé[a, b]| f(x) — fr(x) < €}. (Show that {E,,} is an open cover of [a, b] and use the 
Heine-Borel Theorem.) 

23. Express a measurable function as the difference of nonnegative measurable functions and 
thereby prove the general Simple Approximation Theorem based on the special case of a 
nonnegative measurable function. 


24. Let I be an interval and f: J R be increasing. Show that f is measurable by first showing 
that, for each natural number n, the strictly increasing function x-> f(x) +x/n is measurable, 
and then taking pointwise limits. 


3.3 LITTLEWOOD’S THREE PRINCIPLES, EGOROFF’S THEOREM, 
AND LUSIN’S THEOREM 


Speaking of the theory of functions of a real variable, J. E. Littlewood says,’ “The extent 
of knowledge required is nothing like so great as is sometimes supposed. There are three 
principles, roughly expressible in the following terms: Every [measurable] set is nearly a 
finite union of intervals; every [measurable] function is nearly continuous; every pointwise 
convergent sequence of [measurable] functions is nearly uniformly convergent. Most of the 
results of [the theory] are fairly intuitive applications of these ideas, and the student armed 
with them should be equal to most occasions when real variable theory is called for. If one 
of the principles would be the obvious means to settle the problem if it were ‘quite’ true, it 
is natural to ask if the ‘nearly’ is near enough, and for a problem that is actually solvable it 
generally is.” 

Theorem 12 of Chapter 2 is one precise formulation of Littlewood’s first principle: It 
tells us that given a measurable set E of finite measure, then for each € > 0, there is a finite 
disjoint collection of open intervals whose union U/ is “nearly equal to” E in the sense that 
m(E~U)+m(U~E) <e. 

A precise realization of the last of Littlewood’s principle is the following surprising 
theorem. 


Egoroff's Theorem Assume E has finite measure. Let {f,} be a sequence of measurable 
functions on E that converges pointwise on E to the real-valued function f. Then for each 
€ > 0, there is a closed set F contained in E for which 


{fn} — f uniformly on F and m(E~ F) <e. 
41 ittlewood [Lit41], page 23. 
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To prove Egoroff’s Theorem it is convenient to first establish the following lemma. 


Lemma 10 Under the assumptions of Egoroff’s Theorem, for each n > 0 and 5 > 0, there is a 
measurable subset A of E and an index N for which 


lfn — f|<nonA for alln > N and m(E~ A) <6. 


Proof For each k, the function | f — f| is properly defined, since f is real-valued, and it is 
measurable, so that the set {x € E|| f(x) — f;(x)| <n} is measurable. The intersection of a 
countable collection of measurable sets is measurable. Therefore 


E, = {xe E| | f(x) — fe(x)| < for all k > n} 


is a measurable set. Then {E,}°, is an ascending collection of measurable sets, and 
E = U% En, since {f,} converges pointwise to f on E. We infer from the continuity of 
measure that 

m(E)= lim m(En ). 
Since m(E) < oo, we may choose an index N for which m( Ey ) > m(E) — «. Define A = E, 
and observe that, by the excision property of measure, m(E~A) = m(E)—m(Ey)<e. U 


Proof of Egoroff’s Theorem For each natural number n, let A, be a measurable subset 
of E and N(n) an index which satisfy the conclusion of the preceding lemma with 6 = 
¢/2"+! and n = 1/n, that is, 


m(E~A,) <«/2"*! (2) 
and | 
l fi — f| <1/n on A, for all k > N(n). (3) 
Define 
0, @] 
A=() An. 
=1 


By De Morgan’s Identities, the countably subadditivity of measure and (2), 


m(E~A)= n(Ce~ au < Ss m(E~A,)< Ss e/2"t! = €/2. 
n=1 n=1 n=1 


We claim that { f,} converges to f uniformly on A. Indeed, let « > 0. Choose an index ng 
such that 1/no < €. Then, by (3), 


| fi — f| <1/no on A,, for k > N(no). 
However, A C A,, and 1/no < € and therefore 
lfc — f| <¢on A fork > N(no). 


Thus { f,} converges to f uniformly on A and m( E~ A) <é/2. 


Finally, by Theorem 11 of Chapter 2, we may choose a closed set F contained in A for 
which m(A~ F) <«/2. Thus m(E~ F) <e and {f,}— f uniformly on F. CO 
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It is clear that Egoroff’s Theorem also holds if the convergence is pointwise a.e. and 
the limit function is finite a.e. 

We now present a precise version of Littlewood’s second principle in the case the 
measurable function is simple and then use this special case to prove the general case of the 
principle, Lusin’s Theorem. 


Proposition 11 Let f be a simple function defined on E. Then for each « > 0, there is a 
continuous function g on R and a closed set F contained in E for which 


f =gonFandm(E~F)<e. 


Proof Let a), a2,...,a, be the finite number of distinct values taken by f, and let them 
be taken on the sets EF), Fo,..., En, respectively. The collection {Ex};_, 1s disjoint since 
the a,’s are distinct. According to Theorem 11 of Chapter 2, we may choose closed sets 
F\, Fo,..., F, such that for each index k, 1 <k <n, 


Fi, C EX and m( Ex ~ Fx) <e/n. 


Then F = Uy, Fx, being the union of a finite collection of closed sets, is closed. Since 
{Ex};_1 18 disjoint, 


n 


m(E~F) -™(U ~A] = \ m(Ex~ Fi) <e. 
k=1 


k=1 


Define g on F to take the value a, on F;, for 1 < k <n. Since the collection {Fi}p_, is 
disjoint, g is properly defined. Moreover, g is continuous on F since for a point x € F;, there 
is an open interval containing x which is disjoint from the closed set U.«; F, and hence on 
the intersection of this interval with F the function g is constant. But g can be extended 
from a continuous function on the closed set F to a continuous function on all of R (see 
Problem 25). The continuous function g on R has the required approximation properties. C1 


Lusin’s Theorem Let f be a real-valued measurable function on E. Then for each « >0, there 
is a continuous function g on R and a closed set F contained in E for which 


f =gon Fandm(E~F) <e. 


Proof We consider the case that m(E) < oo and leave the extension to m(E) = oo as 
an exercise. According to the Simple Approximation Theorem, there is a sequence {f,} 
of simple functions defined on E that converges to f pointwise on E. Let n be a natural 
number. By the preceding proposition, with f replaced by f, and « replaced by ¢/2"+!, we 
may choose a continuous function g, on R and a closed set F, contained in E for which 


fn = 8m On F, and m(E~ F,) <¢/2"*!. 


According to Egoroff’s Theorem, there is a closed set Fy contained in E such that { fa} 
converges to f uniformly on Fo and m(E~ Fo) < «/2. Define F = Oreo Fn. Observe that, 
by De Morgan’s Identities and the countable subadditivity of measure, 
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m(E~ F) = m(te~myuUte~ <6/2+ S ¢/2nt! = ¢. 


n=1 n=1 


The set F is closed since it is the intersection of closed sets. Each f, is continuous on F 
since F C F, and f, = gn on Fy. Finally, {f,} converges to f uniformly on F since F C Fo. 
However, the uniform limit of continuous functions is continuous, so the restriction of f to 
F is continuous on F. Finally, there is a continuous function g defined on all of R whose 
restriction to F equals f (see Problem 25). This function g has the required approximation 
properties. 


25. 


26. 


27. 


28. 


29. 
30. 


31. 


PROBLEMS 


Suppose f is a function that is continuous on a closed set F of real numbers. Show that 
f has a continuous extension to all of R. This is a special case of the forthcoming Tietze 
Extension Theorem. (Hint: Express R ~ F as the union of a countable disjoint collection of 
open intervals and define f to be linear on the closure of each of these intervals.) 


For the function f and the set F in the statement of Lusin’s Theorem, show that the restriction 
of f to F is a continuous function. Must there be any points at which f, considered as a 
function on E, is continuous? 


Show that the conclusion of Egoroff’s Theorem can fail if we drop the assumption that the 
domain has finite measure. 


Show that Egoroff’s Theorem continues to hold if the convergence is pointwise a.e. and f is 
finite a.e. 


Prove the extension of Lusin’s Theorem to the case that EF has infinite measure. 


Prove the extension of Lusin’s Theorem to the case that f is not necessarily real-valued, but 
may be finite a.e. 


Let {f,} be a sequence of measurable functions on E that converges to the real-valued f 
pointwise on E. Show that E = U7?) Ex, where for each index k, E, is measurable, and { f,} 
converges uniformly to f on each E;, ifk > 1, and m( E;) = 0. 
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We now turn to our main object of interest in Part I, the Lebesgue integral. We define this 
integral in four stages. We first define the integral for simple functions over a set of finite 
measure. Then for bounded measurable functions f over a set of finite measure, in terms of 
integrals of upper and lower approximations of f by simple functions. We define the integral 
of a general nonnegative measurable function f over E to be the supremum of the integrals 
of lower approximations of f by bounded measurable functions that vanish outside a set of 
finite measure; the integral of such a function is nonnegative, but may be infinite. Finally, 
a general measurable function is said to be integrable over E provided [ rif| < oo. We 
prove that linear combinations of integrable functions are integrable and that, on the class 
of integrable functions, the Lebesgue integral is a monotone, linear functional. A principal 
virtue of the Lebesgue integral, beyond the extent of the class of integrable functions, is 
the availability of quite general criteria which guarantee that if a sequence of integrable 
functions { f,} converge pointwise almost everywhere on E to f, then 


tim, [a= f Lui, sl |r 


We refer to that as passage of the limit under the integral sign. Based on Egoroff’s 
Theorem, a consequence of the countable additivity of Lebesgue measure, we prove 
four theorems that provide criteria for justification of this passage: the Bounded Convergence 
Theorem, the Monotone Convergence Theorem, the Lebesgue Dominated Convergence 
Theorem, and the Vitali Convergence Theorem. 


4.1 THE RIEMANN INTEGRAL 


We recall a few definitions pertaining to the Riemann integral. Let f be a bounded real- 
valued function defined on the closed, bounded interval [a, b]. Let P = {xo, x1,..., Xn} bea 
partition of [a, b], that is, 

a=xX)<x<...<x, =D. 
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Define the lower and upper Darboux sums for f with respect to P, respectively, by | : 
LF, P) = Sym (a1) 
and ; 
. U(f, P)= Mi (4+), 
where,! for 1 <i <n, | 
m; = inf { f(x) | xj-1<x <x} and Mj = sup {f(x)| xj-1<x<-xi}. 


We then define the lower and upper Riemann integrals of f over [a, b], respectively, by 
pb | 
(R) | f =sup {L(f, P)| Pa partition of [a, b]} 
“va 


and 
pb 
(R) | f =inf {U(f, P)| Pa partition of [a, b]}. 


Since f is assumed to be bounded and the interval [a, b] has finite length, the lower and 
upper Riemann integrals are finite. The upper integral is always at least as large as the lower 
integral, and if the two are equal we say that f is Riemann integrable over [a, b]” and call 
this common value the Riemann integral of f over [a, b]. We denote it by 


| b 

(R) ff 
to temporarily distinguish it it from the Lebesgue integral, which we consider i in the next 
section. 


A real-valued function ys defined on [a, b] is called a step function provided there is a 
partition P = {xq, x1,..., Xn} of [a, b] and numbers cj, ..., c, such that for 1 <i <n, 


W(x) = c if xi-4 <x < x. 


Observe that ; 
L(p, P) = > ci(x; — x31) = U(y, P). 
i=l 


‘Tf we define 
= inf { f(x) | Xj-1 <x < xj} and M; = sup { f(x) | xj-1 <x <x}, 
so the infima and suprema are taken over closed subimtervals, we arrive at the same value of the upper and lower 
Riemann integral. 
2An elegant theorem of Henri Lebesgue, Theorem 8 of Chapter 5, tells us that a necessary and sufficient 


condition for a bounded function f to be Riemann integrable over [a, b] is that the set of points m [a, b] at which 
f fails to be continuous has Lebesgue measure zero. 
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From this and the definition of the upper and lower Riemann integrals, we infer that a step 
function ys is Riemann integrable and | 


b n | 
(R) | p= 2 ile — Xj-1). 


Therefore, we may reformulate the definition of the lower and upper Riemann integrals as 


follows: 
b b 
iw) [ss fs 


nf sine [+ 


Example (Dirichlet’s Function) Define f on [0, 1] by setting f(x) = 1 if x is rational and 
0 if x is irrational. Let P be any partition of [0, 1]. By the density of the rationals and the 
irrationals,° 


g astep function and gy < f on fa, oh 


and 


w a step function and > f on [a, oh. 


L(f, P) =OandU(f, P) =1. 
Thus 


@)f s=o<1=(m)fs 


so f is not Riemann integrable. The set of rational numbers in [0,1] is countable.’ Let Crayamt 
be an enumeration of the rational numbers in [0, 1]. For a natural number n, define fn On 
[0, 1] by setting f,(x) = 1, if x = q for some qx With 1 <k <n, and f(x) = 0 otherwise. 
Then each f,, is a step function, so it is Riemann integrable. Thus, { fn} 1S an increasing 
sequence of Riemann integrable functions on [0, 1], | 


[fn| < 10n [0, 1] for all n 
and . | 
{fn} > f pointwise on [0, 1]. 


However, the limit function f fails to be Riemann integrable on [0, 1]. 


PROBLEMS 
1. Show that, in the above Dirichlet function example, { f,} fails to converge to f uniformly on 
[0, 1]. | 


2. A partition P’ of [a, D] is called a refinement of a partition P provided each partition point 
of P is also a partition point of P’. For a bounded function f on [a, b], show that under 
refinement lower Darboux sums increase and upper Darboux sums decrease. — 


3See page 12. 
4See page 14. 
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3. Use the preceding problem to show that for a bounded function on a closed, bounded interval, 
each lower Darboux sum is no greater than each upper Darboux sum. From this conclude 
that the lower Riemann integral is no greater than the upper Riemann integral. 


4. Suppose the bounded function f on [a, b] is Riemann integrable over [a, b]. Show that there 
is a sequence {P,} of partitions of [a, b] for which limy-; 00 [U( f, Pn) —L(f, Pn)| =0. 

5. Let f be a bounded function on [a, b]. Suppose there is a sequence {P,} of partitions of 
[a, b] for which lim, _, ..[U(f, Pn) — L( f, Pn)] = 0. Show that f is Riemann integrable 
over [a, b]. 

6. Use the preceding problem to show that since a continuous function f on a closed, bounded 
interval [a, b] is uniformly continuous on [a, b], it is Riemann integrable over [a, b]. 

7. Let f be an increasing real-valued function on [0, 1]. For a natural number n, define P, to 
be the partition of [0, 1] into n subintervals of length 1/n. Show that U( f, P,) -—L(f, Pn) < 
1/n[f(1) — f(0)]. Use Problem 5 to show that f is Riemann integrable over [0, 1]. 


8. Let {f,} be a sequence of bounded functions that converges uniformly to f on the closed, 
bounded interval [a, b]. If each f,, is Riemann integrable over [a, b], show that f also is 
Riemann integrable over [a, b]. Is it true that 


Jin, [n= fr 


4.2 THE LEBESGUE INTEGRAL OF A BOUNDED MEASURABLE FUNCTION 
OVER A SET OF FINITE MEASURE 


The Dirichlet function, which was examined in the preceding section, exhibits one of the 
principal shortcomings of the Riemann integral: a uniformly bounded sequence of Riemann 
integrable functions on a closed, bounded interval can converge pointwise to a function that 
is not Riemann integrable. We will see that the Lebesgue integral does not suffer from this 
shortcoming. 

Henceforth we only consider the Lebesgue integral, unless explicitly mentioned oth- 
erwise, and so we use the pure integral symbol to denote the Lebesgue integral. The 
forthcoming Theorem 3 tells us that any bounded function that is Riemann integrable over 
[a, b] is also Lebesgue integrable over [a, b] and the two integrals are equal. 

Recall that a measurable real-valued function w defined on a set E is said to be simple 
provided it takes only a finite number of real values. If takes the distinct values a1,..., ay 
on E, then, by the measurability of , its level sets #~!(a;) are measurable and we have the 
canonical representation of # on E as 


y= Sa; - XE, on E, where each E; = w!(a;) = {xe E| p(x) =a;}. (1) 
ix1 


The canonical representation is characterized by the E;’s being disjoint and the a,’s being 
distinct. 


Definition For a simple function defined on a set of finite measure E, we define the integral 
of w over E by 


[v= a -m( Ej), 
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where has the canonical representation given by (1). 


Lemma 1 Let {E;}"_, be a finite disjoint collection of measurable subsets of a set of finite 
measure E. For 1 <i <n, let a; be a real number. 


n 
Ife = > 4;- xe, on E, then | 


n 
Q= > 4i -m( Ej). 


Proof The collection {E£;}?_, is disjoint but the above may not be the canonical representation 
since the a;’s may not be distinct. We must account for possible repetitions. Let {A;,..., Am} 
be the distinct values taken by g. For 1 < j < m, set Aj = {x € E| p(x) = Aj}. By definition 
of the integral in terms of canonical representations, 


m 
p= SY Aj-m(A;). 
E =] 
For 1 < j < m, let J; be the set of indices i in {1,...,n} for which a; = A;. Then 
{1,....n}= Ui I;, and the union is disjoint. Moreover, by finite additivity of measure, 


m(A;) = S m(£;) for all 1 < j < m. 


Le 1; 


Therefore 


i= j=l rel; J= iél; 
m 
= Srj-m(Aj)= | 9. q 
j=l VE 


One of our goals is to establish linearity and monotonicity properties for the general 
Lebesgue integral. The following is the first result in this direction. 


Proposition 2 (Linearity and Monotonicity of Integration) Let ¢ and be simple functions 
defined on a set of finite measure E. Then for any a and B, 


[(ao+ pny =af orev 


ife <wWoneE, then | os | ¥. 
E E 


Moreover, 


Proof Since both ¢ and w take only a finite number of values on E, we may choose a finite 
disjoint collection {E;}?_, of measurable subsets of EZ, the union of which is E, such that 
and w are constant on each £;. For eachi, 1 <i <n, let a; and b;, respectively, be the values 
taken by ¢ and # on E;. By the preceding lemma, 


= Sa;-m(E;) and | v= Sb;-m(E;) 
E i=l E i=1 
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However, the simple function ag + Bw takes the constant value aa; + Bb; on E;. Thus, again 
by the preceding lemma, 


[ (ae + an = ¥\ (aa; + b;) -m(E:) 


i=] 
= aS) aj-m(E) +8 > b-m(E:)=af o+B | ¥. 
i=] i=1 E E 


To prove monotonicity, assume g < W on E. Define n = W — g on E. By linearity, 


[v-[e=[w-e)=[nz0 


since the nonnegative simple function 7 has a nonnegative integral. L 


The linearity of integration over sets of finite measure of simple functions shows 
that the restriction in the statement of Lemma 1 that the collection {E;}?_, be disjoint is 
unnecessary. 

A step function takes only a finite number of values and each interval is measurable. 
Thus a step function is simple. Since the measure of a singleton set is zero and the measure 
of an interval is its length, we infer from the linearity of Lebesgue integration for simple 
functions defined on sets of finite measure that the Riemann integral over a closed, bounded 
interval of a step function agrees with the Lebesgue integral. 

Let f be a bounded real-valued function defined on a set of finite measure FE. By 
analogy with the Riemann integral, we define the lower and upper Lebesgue integral, 
respectively, of f over E to be 


sup | °| psinpe and < fn 
E 


and 
ot | ‘| ysinpleandf <40n | 
E 


Since f is assumed to be bounded, by the monotonicity property of the integral for simple 
functions, the lower and upper integrals are finite and the upper integral is always at least as 
large as the lower integral. 


Definition A bounded function f on a domain E of finite measure is said to be Lebesgue 
integrable over E provided its upper and lower Lebesgue integrals over E are equal. The 
common value of the upper and lower integrals is called the Lebesgue integral, or simply the 
integral, of f over E and is denoted by J, f. 


Theorem 3 Let f be a bounded function defined on the closed, bounded interval [a, b|. If f is 
Riemann integrable over [a, b], then it is Lebesgue integrable over [a, b| and the two integrals 
are equal. 
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Proof The assertion that f is Riemann integrable means that, setting J = [a, b], 


sup o fe | g a step function, g < i = inf G Jy | wastep function, f < ’ 


To prove that f is Lebesgue integrable we must show that 


ww {fe psinpe,e =f} =int{ [| ying, =v} 
I I 


However, each step function is a simple function and, as we have already observed, for 
a step function, the Riemann integral and the Lebesgue integral are the same. Therefore 
the first equality implies the second and also the equality of the Riemann and Lebesgue 
integrals. LJ 


We are now fully justified in using the symbol /,, f, without any preliminary (R), to 
denote the integral of a bounded function that is Lebesgue integrable over a set of finite 


measure. In the case of an interval E = [a, b], we sometimes use the familiar notation f f 
to denote Sia b] f and sometimes it is useful to use the classic Leibniz notation f f(x) dx. 


Example The set E of rational numbers in [0, 1] is a measurable set of measure zero. The 
Dirichlet function f is the restriction to [0, 1] of the characteristic function of E, yz. Thus 
f is integrable over [0, 1] and 


f= 1-yg=1-m(E)=0. 
[0, 1] [0, 1] 


We have shown that f is not Riemann integrable over [0, 1]. 


Theorem 4 Let f be a bounded measurable function on a set of finite measure E. Then f is 
integrable over E. 


Proof Let n be a natural number. By the Simple Approximation Lemma, with ¢ = 1/n, 
there are two simple functions ¢,, and yw, defined on E for which 


Qn <f <w, on E, 


and 
0< Wn — On <1/nonE. 


By the monotonicity and linearity of the integral for simple functions, 


0< | de- | = | ibm —on) < 1/n-m(E). 
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However, 


osint| fo vsimpl. y= ssp} | oxime < / 
E E 

_— n <1/n-m(E). 
<|¥ [¢ <1/n-m(E) 


This inequality holds for every natural number n and m(E) is finite. Therefore the upper 
and lower Lebesgue integrals are equal and thus the function f is integrable over E. LI 


It turns out that the converse of the preceding theorem is true; a bounded function on 
a set of finite measure is Lebesgue integrable if and only if it is measurable: we prove this 
later (see the forthcoming Theorem 7 of Chapter 5). This shows, in particular, that not every 
bounded function defined on a set of finite measure is Lebesgue integrable. In fact, for any 
measurable set E of finite positive measure, the restriction to E of the characteristic function 
of each nonmeasurable subset of E fails to be Lebesgue integrable over E. 


Theorem 5 (Linearity and Monotonicity of Integration) Let f and g be bounded measurable 
functions on a set of finite measure E. Then for any a and B, 


(af+pg)=a} ft+B] zg. (2) 
| [r+e], 


if f<gonE, nen J fs] (3) 


Moreover, 


Proof A linear combination of measurable bounded functions is measurable and bounded. 
Thus, by Theorem 4, af + Bg is integrable over E. We first prove linearity for B = 0. If is 
a simple function so is aw, and conversely (if a#0). We established linearity of integration 
for simple functions. Let a > 0. Since the Lebesgue integral is equal to the upper Lebesgue 


integral, 
— inf _ ; 
Jef vat, |v a iat J! Wal= aff 


For a < 0, since the Lebesgue integral is equal both to the upper Lebesgue integral and the 
lower Lebesgue integral, 


[jas ~ jar de® _" lolalet E [e/a = “|, I 


It remains to establish linearity in the case that a = B = 1. Let w; and > be simple functions 
for which f < W and g < W on E. Then wy + ys is a simple function and f+ g < % +2 on 
E. Hence, since {,.( f + g) is equal to the upper Lebesgue integral of f + g over E, by the 
linearity of integration for simple functions, 


[(r+es [in+m)= fons [ve 
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The greatest lower bound for the sums of integrals on the right-hand side, as y and W vary 
among simple functions for which f < y and g < W, equals /,, f + [,,g. These inequalities 
tell us that {-( f + g) is a lower bound for these same sums. Therefore, 


[rtesf rtf 


It remains to prove this inequality in the opposite direction. Let g; and 2 be simple functions 
for which g; < f and g < gon E. Then g, + ¢2 < f+gon E and gj + ¢ is simple. Hence, 
since [-( f +) is equal to the lower Lebesgue integral of f + g over E, by the linearity of 
integration for simple functions, 


[(r+e)= [io+en= fot fo 


The least upper bound bound for the sums of integrals on the right-hand side, as ¢; and 
$2 vary among simple functions for which g; < f and g < g, equals [ rf t+ f 78: These 
inequalities tell us that f,,( f + g) is an upper bound for these same sums. Therefore, 


[rsa [r+ fe 


This completes the proof of linearity of integration. 


To prove monotonicity, assume f < gon E. Define h = g — f on E. By linearity, 


fe-[r-fte-n=]n 


The function h is nonnegative and therefore y < h on E, where y=0 on E. Since the integral 
of h equals its lower integral, [,,h = f, = 0. Therefore, {, f < f,8. O 


Corollary 6 Let f be a bounded measurable function on a set of finite measure E. Suppose A 
and B are disjoint measurable subsets of E. Then 


[ fafrefe (4) 


Proof Both f - y4 and f - yg are bounded measurable functions on E. Since A and B are 
disjoint, 

f+ Xaup =f -Xat fxs. 
Furthermore, for any measurable subset E; of E (see Problem 10), 


[t= [tx 


Therefore, by the linearity of integration, 


[f= f fexne= [text [pexe= fore fir - 
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Corollary 7 Let f be a bounded measurable function on a set of finite measure E. Then 


I < [in (5) 


Proof The function | f| is measurable and bounded. Now 


-Ifls f <l|flon E. 


By the linearity and monotonicity of integration, 


-firisfrsfin 


that is, (5) holds. a 


Proposition 8 Let {f,} be a sequence of bounded measurable functions on a set of finite 
measure E. 


If {fn} — f uniformly on E, then lim [m= fr 
n—-> CO E E 


Proof Since the convergence is uniform and each f, is bounded, the limit function f is 
bounded. The function f is measurable since it is the pointwise limit of a sequence of 
measurable functions. Let € > 0. Choose an index N for which 


lf -— fal < €/m(E) on E for all n > N. (6) 


By the linearity and monotonicity of integration and the preceding corollary, for eachn > N, 


[t- t [ir- fl 


Therefore limn— oo fr fn = Saf. O 


< [ir- fu < [e/m(E)]-m(E) =e. 


This proposition is rather weak since frequently a sequence will be presented that 
converges pointwise but not uniformly. It is important to understand when it is possible to 
infer from 

{fn} > f pointwise a.e. on E 


{onl= flint] [1 


We refer to this equality as passage of the limit under the integral sign.” Before proving our 
first important result regarding this passage, we present an instructive example. 


that 


lim 
n—-> & 


>This phrase is taken from I. P. Natanson’s Theory of Functions of a Real Variable [Nat55]. 
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Example For each natural number n, define f,, on [0, 1] to have the value 0if x > 2/n, have 
f(1/n) =n, f(0) = 0 and to be linear on the intervals [0, 1/n] and [1/n, 2/n]. Observe that 
fh fn = 1 for each n. Define f =0 on [0, 1]. Then 


1 1 
{fnr}—> f pointwise on [0, 1], but lim. i fn ® | f. 
n- 0 0 


Thus, pointwise convergence alone is not sufficient to justify passage of the limit under the 
integral sign. 


The Bounded Convergence Theorem Let { f,,} be a sequence of measurable functions on a 
set of finite measure E. Suppose { f,} is uniformly pointwise bounded on E, that is, there is a 
number M > 0 for which 

[fn] < Mon E forall n. 


If {fn}—> f pointwise on E, then lim [n= 
n> eo E E 


Proof The proof of this theorem furnishes a nice illustration of Littlewood’s Third Principle. 
If the convergence is uniform, we have the easy proof of the preceding proposition. However, 
Egoroff’s Theorem tells us, roughly, that pointwise convergence is “nearly” uniform. 


The pointwise limit of a sequence of measurable functions is measurable. Therefore f 
is measurable. Clearly | f| < M on E. Let A be any measurable subset of E and n a natural 
number. By the linearity and additivity over domains of the integral, 


[e- [r= fte-n=fte-n+ fo nef cn. 


Therefore, by Corollary 7 and the monotonicity of integration, 


[n-fr 


To prove convergence of the integrals, let e>0. Since m( E)<ooand f isreal-valued, Egoroff’s 
Theorem tells us that there is a measurable subset A of E for which { f,}—> f uniformly on A 
and m( E~ A) < €/4M. By uniform convergence, there is an index N for which 


< [if f1+2M-m(E~A) (7) 


€ 
1- f{|<—-_ Af N. 
lfn — fl 2 m(E) on A for all n > 


Therefore, for n > N, we infer from (7) and the monotonicity of integration that 


[o-[o<scm 


Hence the sequence of integrals {/,, f,} converges to / rd: LI 


-m(A)+2M-m(E~A) <e. 


Remark Prior to the proof of the Bounded Convergence Theorem, no use was made of the 
countable additivity of Lebesgue measure on the real line. Only finite additivity was used, and 
it was used just once, in the proof of Lemma 1. But for the proof of the Bounded Convergence 
Theorem we used Egoroff’s Theorem. The proof of Egoroff’s Theorem needed the continuity 
of Lebesgue measure, a consequence of countable additivity of Lebesgue measure. 
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PROBLEMS 


9. Let E have measure zero. Show that if f is a bounded function on E, then f is measurable 
and J, f =0. 


10. Let f be a bounded measurable function on a set of finite measure E. For a measurable 
subset A of E, show that f, f =f, f-Xa- 


11. Does the Bounded Convergence Theorem hold for the Riemann integral? 


12. Let f be a bounded measurable function on a set of finite measure E. Assume g is bounded 
and f = ga. on E. Show that J, f = fy g. 


13. Does the Bounded Convergence Theorem hold if m(E) < oo but we drop the assumption 
that the sequence {| f,,|} is uniformly bounded on E? 


14. Show that Proposition 8 is a special case of the Bounded Convergence Theorem. 
15. Verify the assertions in the last Remark of this section. 


16. Let f be a nonnegative bounded measurable function on a set of finite measure E. Assume 
Se f =0. Show that f = 0a.e. on E. 


4.3 THE LEBESGUE INTEGRAL OF A MEASURABLE 
NONNEGATIVE FUNCTION 


A measurable function f on E is said to vanish outside a set of finite measure provided there 
is a Subset Eo of E for which m( Ey) < oo and f =0 on E~ Ep. It is convenient to say that a 
function that vanishes outside a set of finite measure has finite support and define its support 
to be {x € E| f(x) #0}.° In the preceding section, we defined the integral of a bounded 
measurable function f over a set of finite measure E. However, even if m(E) = on, if f is 
bounded and measurable on E but has finite support, we can define its integral over E by 


[s=[r 


where Eo has finite measure and f =0 on E ~ Ep. This integral is properly defined, that is, it 
is independent of the choice of set of finite measure Ep outside of which f vanishes. This is a 
consequence of the additivity over domains property of integration for bounded measurable 
functions over a set of finite measure. 


Definition For f a nonnegative measurable function on E, we define the integral of f over 
E by’ \ 


| f =sup | h | h bounded, measurable, of finite support and 0 <h < f on eh (8) 
E E 


6But care is needed here. In the study of continuous real-valued functions on a topological space, the support of 
a function is defined to be the closure of the set of points at which the function is nonzero. 

TThis is a definition of the integral of a nonnegative extended real-valued measurable function; it is not a 
definition of what it means for such a function to be integrable. The integral is defined regardless of whether the 
function is bounded or the domain has finite measure. Of course, the integral is nonnegative since it is defined to 
be the supremum of a set of nonnegative numbers. But the integral may be equal to 00, as it is, for instance, for 
a nonnegative measurable function that takes a positive constant value of a subset of E of infinite measure or the 
value oo on a subset of E of positive measure. 
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Chebychev’s Inequality Let f be a nonnegative measurable function on E. Then for any 
A> 0, 


mixeE| fix)zass-f s (9) 
E 
Proof Define E, = {x € E| f(x) > A}. First suppose m(E,) = oo. Letn be a natural number. 


Define Ey, , = E, N[—n, n] and wi, = A- x E,,--Lhen w, is a bounded measurable function 
of finite support, 


Am(Exn) = | th and 0 < w, < f on E for all n. 
E 


We infer from the continuity of measure that 


oo =A-m(E))=A- lim m(E),) = lim fos|r 
n> 0O n> 00 Jr E 


Thus inequality (9) holds since both sides equal oo. Now consider the case m(E)) < ov. 
Define h = d- yz,. Then h is a bounded measurable function of finite support and 0 < h < f 
on E. By the definition of the integral of f over E, 


A-m(E)) =| < | f. 
E E 
Divide both sides of this inequality by A to obtain Chebychev’s Inequality. C 


Proposition 9 Let f be a nonnegative measurable function on E. Then 
| f = 0ifand only if f =0ae.on E. (10) 
E 


Proof First assume {, f = 0. Then, by Chebychev’s Inequality, for each natural num- 
ber n, m{xeX| f(x) = 1/n} = 0. By the countable additivity of Lebesgue measure, 
m{x€X| f(x) > 0} = 0. Conversely, suppose f = 0a.e.on E. Let ¢ be a simple function 
and h a bounded measurable function of finite support for which 0 < 9 <h < f on E. Then 
g = 0a.e. on E and hence f,¢ = 0. Since this holds for all such g, we infer that { ph =0. 
Since this holds for all such h, we infer that f, f = 0. O 


Theorem 10 (Linearity and Monotonicity of Integration) Let f and g be nonnegative 
measurable functions on E. Then for any a > 0 and B > 0, 


[tar+be)=0f r+6f (11) 


Moreover, 


if f <gonE, tren | f=] 6 (12) 
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Proof Fora>0,0<h < f on Eif and only if0 < ah < af on E. Therefore, by the linearity 
of the integral of bounded functions of finite support, {, af = a f,, f. Thus, to prove linearity 
we need only consider the case a = B = 1. Let h and g be bounded measurable functions of 
finite support for which 0 < h < f andO <k < gon E. We have0<h+k< f+gonE, 
and h + k also is a bounded measurable function of finite support. Thus, by the linearity of 
integration for bounded measurable functions of finite support, 


[n+ [a= [tes [iree 


The least upper bound for the sums of integrals on the left-hand side, as h and k vary 
among bounded measurable functions of finite support for which h < f andk < g, equals 
f gst t+ f £8: Lhese inequalities tell us that f e(f +) is an upper bound for these same sums. 


Therefore, 
[s+fes [ (7 +2). 
E E JE 


It remains to prove this inequality in the opposite direction, that is, 


reas] rtfe 


By the definition of /,(f +g) as the supremum of /,¢ as @ ranges over all bounded 
measurable functions of finite support for which 0 < é < f+g on E, to verify this inequality 
it is necessary and sufficient to show that for any such function @, 


fes[r+fe (13) 


For such a function @, define the functions h and k on E by 
h=min{f, 2}andk =£—honE. 


Let x belong to E. If £(x) < f(x), then k(x) = 0 < g(x); if €(x) > f(x), then h(x) = 
(x) — f(x) < g(x). Therefore, h < g on E. Both h and k are bounded measurable functions 
of finite support. We have 


0<h<f,0<k<gand@=h+konE. 


Hence, again using the linearity of integration for bounded measurable functions of finite 
support and the definitions of /, f and J, g, we have 


fr=[nsfes[rrfe 


Thus (13) holds and the proof of linearity is complete. 


In view of the definition of [ ¢ J as a supremum, to prove the monotonicity inequality 
(12) it is necessary and sufficient to show that if h is a bounded measurable function of finite 
support for which 0 <h < f on E, then 


fuse (14) 
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Let h be such a function. Then h < g on E. Therefore, by the definition of f,g as a 
supremum, fh < {,,g. This completes the proof of monotonicity. O 


Theorem 11 (Additivity Over Domains of Integration) Let f be a nonnegative measurable 
function on E. If A and B are disjoint measurable subsets of E, then 


ies Ra 


In particular, if Eo is a subset of E of measure zero, then 
fr=[ + (15) 
JE E~ Ep 


Proof Additivity over domains of integration follows from linearity as it did for bounded 
functions on sets of finite measure. The excision formula (15) follows from additivity over 
domains and the observation that, by Proposition 9, the integral of a nonnegative function 
over a set of measure zero is zero. L] 


The following lemma will enable us to establish several criteria to justify passage of the 
limit under the integral sign. 


Fatou’s Lemma Let { f,,} be a sequence of nonnegative measurable functions on E. 
If fr} f pointwise a.e.on E, then [ f< mint Sn: (16) 
JE E 


Proof In view of (15), by possibly excising from E a set of measure zero, we assume the 
pointwise convergence is on all of E. The function f is nonnegative and measurable since 
it is the pointwise limit of a sequence of such functions. To verify the inequality in (16) it 
is necessary and sufficient to show that if h is any bounded measurable function of finite 
support for which 0 < h < f on E, then 


In < lim inf [ Sn: (17) 
JE JE 
Let h be such a function. Choose M > 0 for which |h| < M on E. Define Ep = {x € E| h(x)#0}. 
Then m( Eo) < oo. Let n be a natural number. Define a function h, on E by 
h, = min{h, f,} on E. 
Observe that the function h, is measurable, that 
0 <h, < M on Ep and h, =0 0n E~ Ep. 


Furthermore, for each x in E, since h(x) < f(x) and {f,(x)} > f(x), {An(x)} > h(x). We 
infer from the Bounded Convergence Theorem applied to the uniformly bounded sequence 
of restrictions of h,, to the set of finite measure Eg, and the vanishing of each h, on E ~ Eo, that 


lim | h, = lim n= | h= | h 
n—> co E n—> oo Eo Eo E 
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However, for each n, hy, < f, on E and therefore, by the definition of the integral of f,, over 
E, frhn < fp fn. Thus, 


[r= lim hn timing f fn: LJ 
E N>OSF E 


The inequality in Fatou’s Lemma may be strict. 


Example Let E = (0, 1] and for a natural number n, define f, =n - (0, 1/n). Then {fn} 
converges pointwise on E to f =0 on E. However, 


| #=0<1- lim [fi 
E N>O JSF 


As another example of strict inequality in Fatou’s Lemma, let E = R and for a natural 
number n, define gn = X(n,n+1)- Then {g,} converges pointwise on E to g=0 on E. However, 


| e=0<1- lim | &n- 
E n—> oo E 


However, the inequality in Fatou’s Lemma is an equality if the sequence {f,,} is 
increasing. 


The Monotone Convergence Theorem Let {f,,} be an increasing sequence of nonnegative 
measurable functions on E. 


If {fn} > f pointwise a.e.on E, then im, | fn =| f. 
n— E E 


Proof According to Fatou’s Lemma, 


ff stimint ff, 


However, for each index n, f, < f a.e. on E, and so, by the monotonicity of integration for 
nonnegative measurable functions and (15), [,. fn < J, f. Therefore 


imsup ff < ff 

imsup | f < a 

[f= im, | te O 
E n— 0oO 


Corollary 12 Let {u,,} be a sequence of nonnegative measurable functions on E. 


Hence 


0O 0O 
If f = > un pointwise a.e.on E, then | f=> | un. 
E “JE 


n=l 


Proof Apply the Monotone Convergence Theorem with f, = d7_, ux, for each index n, 
and then use the linearity of integration for nonnegative measurable functions. LJ 
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Definition A nonnegative measurable function f ona measurable set E is said to be integrable 
over E provided 
| f <oo. 
E 


Proposition 13 Let the nonnegative function f be integrable over E. Then f is finite a.e.on E. 


Proof Let n be a natural number. Chebychev’s Inequality and the monotonicity of measure 
tell us that 


1 
m{xe E| f(x) =oo}<m{xeE| fayempsi | s 
But J, f is finite and therefore m{x € E| f(x) = co} = 0. Ol 


Beppo Levi’s Lemma Let { f,,} be an increasing sequence of nonnegative measurable functions 
on E. If the sequence of integrals {{,. fn} is bounded, then { f,} converges pointwise on E to a 
measurable function f that is finite a.e.on E and 


n—> Oo 


lim fa= | <0 


Proof Every monotone sequence of extended real numbers converges to an extended real 
number.® Since { f,} is an increasing sequence of extended real-valued functions on E, we 
may define the extended real-valued nonnegative function f pointwise on E by 


f(x)= hm fn(x) for all x € E. 


According to the Monotone Convergence Theorem, { f zJn}—> J, f. Therefore, since the 
sequence of real numbers {/,, f,} is bounded, its limit is finite and so f,, f < oo. We infer 
from the preceding proposition that f is finite a.e.on E. LI 


PROBLEMS 
17. Let E be a set of measure zero and define f = 00 on E. Show that f,. f = 0. 
18. Show that the integral of a bounded measurable function of finite support is properly defined. 


19. For anumber a, define f(x) = x* for 0 <x <1, and f(0) =0. Compute h f 


20. Let { fn} be a sequence of nonnegative measurable functions that converges to f pointwise on 
E. Let M > 0 be such that Jf, f, < M for alln. Show that {, f < M. Verify that this property 
is equivalent to the statement of Fatou’s Lemma. 


21. Let the function f be nonnegative and integrable over E and « > 0. Show there is a simple 
function 7 on E that has finite support, 0 < » < f on E and f,.|f — | <e. If E is a closed, 
bounded interval, show there is a step function h on E that has finite support and /,, | f —h|<e. 


22. Let {fn} be a sequence of nonnegative measurable functions on R that converges pointwise 
on R to f and f be integrable over R. Show that 


if | f= lim | fn, then [ f=, f= lm im, | fn for any measurable set E. 
R 7 OVE 


8See page 23. 
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23. Let {a,} be a sequence of nonnegative real numbers. Define the function f on E = [1, oo) by 
setting f(x) =a, ifn <x <n+1. Show that f, f = D°0, an. 


24. Let f be a nonnegative measurable function on E. 


(i) Show there is an increasing sequence {¢,} of nonnegative simple functions on E, each of 
finite support, which converges pointwise on E to f. 


(ii) Show that /,, f = sup {/,,¢|¢ simple, of finite support and 0 < ¢ <, f on £}. 


25. Let {fn} be a sequence of nonnegative measurable functions on E that converges pointwise 
on E to f. Suppose f, < f on E for each n. Show that 


sim, f fi= [if 


26. Show that the Monotone Convergence Theorem may not hold for decreasing sequences of 
functions. 


27. Prove the following generalization of Fatou’s Lemma: If {f,} 1s a sequence of nonnegative 
measurable functions on E, then 


E E 


4.4 THE GENERAL LEBESGUE INTEGRAL 
For an extended real-valued function f on E, we have defined the positive part f* and the 
negative part f~ of f, respectively, by 
f* (x) = max{ f(x), 0} and f~ (x) = max{— f(x), 0} for all x € E. 
Then f* and f~ are nonnegative functions on E, 
f=ft-f oE 

and 

Ifl=fT +f on. 
Observe that f is measurable if and only if both f* and f~ are measurable. 


Proposition 14 Let f be a measurable function on E. Then ftand f~ are integrable over E 
if and only if | f| is integrable over E. 


Proof Assume f* and f~ are integrable nonnegative functions. By the linearity of integra- 
tion for nonnegative functions, |f| = ft + f~ is integrable over E. Conversely, suppose 
| f| is integrable over E. Since 0 < ft < |f| and 0 < f~ < |f| on E, we infer from the 
monotonicity of integration for nonnegative functions that both f* and f~ are integrable 
over E. LI 


Definition A measurable function f on E is said to be integrable over E provided |f\ is 
integrable over E. When this is so we define the integral of f over E by 


[-hr-hr 
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Of course, for a nonnegative function f, since f = f+ and f~ =0 on E, this definition 
of integral coincides with the one just considered. By the linearity of integration for bounded 
measurable functions of finite support, the above definition of integral also agrees with the 
definition of integral for this class of functions. 


Proposition 15 Let f be integrable over E. Then f is finite a.e.on E and 
[r=] f if Ey) CE and m( Ey) = 0. (18) 
E E~ Ep 


Proof Proposition 13, tells us that | f| is finite a.e.on E. Thus f is finite a.e.on E. Moreover, 
(18) follows by applying (15) to the positive and negative parts of f. a 


The following criterion for integrability is the Lebesgue integral correspondent of the 
comparison test for the convergence of series of real numbers. 


Proposition 16 (the Integral Comparison Test) Let f be a measurable function on E. 
Suppose there is a nonnegative function g that is integrable over E and dominates f in the 
sense that 

Ifl< gon E. 


foals fin 


Proof By the monotonicity of integration for nonnegative functions, | f|, and hence f, is 
integrable. By the triangle inequality for real numbers and the linearity of integration for 


[ir 


We have arrived at our final stage of generality for the Lebesgue integral for functions 
of a single real variable. Before proving the linearity property for integration, we need to 
address, with respect to integration, a point already addressed with respect-to measurability. 
The point is that for two functions f and g which are integrable over E, the sum f + g is not 
properly defined at points in E where f and g take infinite values of opposite sign. However, 
by Proposition 15, if we define A to be the set of points in E at which both f and g are finite, 
then m( E~ A) = 0. Once we show that f + g is integrable over A, we define 


[ir+ey= [rte 


We infer from (18) that f,.( f +g) is equal to the integral over E of any extension of (f+g)|4 
to an extended real-valued function on all of E. 


Then f is integrable over E and 
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Theorem 17 (Linearity and Monotonicity of Integration) Let the functions f and g be 
integrable over E. Then for any a and B, the function af + Bg is integrable over E and 


[tat+ee)=af fre ¢ 


if f < gon E, then [rss 
JE 


E 


Moreover, 


Proof If a > 0, then [af]t = aft and [af]- = af—, while if a <0, [af]* = -—af- 
and [af]” = —af*. Therefore f,af = af, f, since we established this for nonnegative 
functions f and a > 0. So it suffices to establish linearity in the case a = B = 1. By 
the linearity of integration for nonnegative functions, | f| + |g| is integrable over E. Since 
If + gl < |f| +g] on E, by the integral comparison test, f + g also is integrable over E. 
Proposition 15 tells us that f and g are finite a.e.on E. According to the same proposition, 
by possibly excising from E a set of measure zero, we may assume that f and g are finite on 


E. To verify linearity is to show that 
[r-[r}+|[e-Le], (19) 
E JE E E 


(ft+s)"-(ft+s) =f+e=(ft-f-)+(gt—g°) o£, 


and therefore, since each of these six functions takes real values on E, 


[tr+at - [trs+er = 


But 


(ftel t+fote =(ft+e) + ft+et o£. 


We infer from linearity of integration for nonnegative functions that 


[(r+ey + [ir +f =| (f+) +] f+ [e. 


Since f, g and f + g are integrable over E, each of these six integrals is finite. Rearrange 
these integrals to obtain (19). This completes the proof of linearity. 


To establish monotonicity we again argue as above that we may assume g and f are 
finite on E. Define h = g — f on E. Then h is a properly defined nonnegative measurable 
function on E. By linearity of integration for integrable functions and monotonicity of 
integration for nonnegative functions, 


fs -[ r=] (g- f)= | neo O 


Corollary 18 (Additivity Over Domains of Tntegration) Let f be integrable over E. Assume 
A and B are disjoint measurable subsets of E. Then 


Lobel 
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Proof Observe that | f - v4] < | f| and |f - va| < |f| on E. By the integral comparison test, 
the measurable functions f - y4 and f - yg are integrable over E. Since A and B are disjoint 


f+XauB=f-xXAt+f-Xp ONE. (21) 


But for any measurable subset C of E (see Problem 28), 


[t= [fx 


Thus (20) follows from (21) and the linearity of integration. a 


The following generalization of the Bounded Convergence Theorem provides another 
justification for passage of the limit under the integral sign. 


The Lebesgue Dominated Convergence Theorem Let { f,} be a sequence of measurable 
functions on E. Suppose there is a function g that is integrable over E and dominates { f,} on 
E in the sense that | f,| < g on E forall n. 


If {fn} > f pointwise a.e.on E, then f is integrable over E and im. | fa = | f. 
7 OVE E 


Proof Since | f,| < g on E and |f| < ga.e.on E and g is integrable over E, by the integral 
comparison test, f and each f, also are integrable over E. We infer from Proposition 15 
that, by possibly excising from E a countable collection of sets of measure zero and using the 
countable additivity of Lebesgue measure, we may assume that f and each f, is finite on E. 
The function g — f and for each n, the function g — f,, are properly defined, nonnegative 
and measurable. Moreover, the sequence {g — f,,} converges pointwise a.e.on E to g — f. 
Fatou’s Lemma tells us that 


[ (e— ) stimint | (¢— fy). 
E JE 


Thus, by the linearity of integration for integrable functions, 


[sf r= [(e- 1) timing [ (e- f.) =f e—timsup ff 


that is, 
lim sup | f ns | f . 
E E 
Similarly, considering the sequence {g + f,}, we obtain 
[ stimint [ f. 
E E 


The proof is complete. LI 
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The following generalization of the Lebesgue Dominated Convergence Theorem, the 
proof of which we leave as an exercise (see Problem 32), is often useful (see Problem 33). 


Theorem 19 (General Lebesgue Dominated Convergence Theorem) Let { f,} be a sequence 
of measurable functions on E that converges pointwise a.e.on E to f. Suppose there is a 
sequence {g,} of nonnegative measurable functions on E that converges pointwise a.e.on E to 
g and dominates { f,,} on E in the sense that 


lfal < gn on E forall n. 


if lim n= | ¢<c0, then tim | f=] F 
n—> oo E E n— co E E 


Remark Jn Fatou’s Lemma and the Lebesgue Dominated Convergence Theorem, the 
assumption of pointwise convergence a.e.on E rather than on all of E is not a decoration 
pinned on to honor generality. It is necessary for future applications of these results. We 
provide one illustration of this necessity. Suppose f is an increasing function on all of R. A 
forthcoming theorem of Lebesgue (Lebesgue’s Theorem of Chapter 6) tells us that 


fim Lt Ma) — F(2) _ 


n— oo 1/n 


f'(x) for almost all x. (22) 


From this and Fatou’s Lemma we will show that for any closed, bounded interval [a, }}, 


b 
| f'(x)dx < f(b) — f(a). 


In general, given a nondegenerate closed, bounded interval [a, b] and a subset A of [a, b] that 
has measure zero, there is an increasing function f on [a, b] for which the limit in (22) fails to 
exist at each point in A (see Problem 10 of Chapter 6). 


PROBLEMS 
28. Let f be integrable over E and C a measurable subset of E. Show that {. f = J, f -xc- 


29. For a measurable function f on[1, 00) which is bounded on bounded sets, define a, = {” + f 
for each natural number n. Is it true that f is integrable over [1, 00) if and ofily if the series 
Ln—1 4n converges? Is it true that f is integrable over [1, 00) if and only if the series 5°, ay 
converges absolutely? 


30. Let g be a nonnegative integrable function over E and suppose {f,} is a sequence of 
measurable functions on E such that for each n, | f,| < g a.e.on E. Show that 


J timing fy < timing f fn <limsup | fu < | timsup fr 
E E E E 


31. Let f be a measurable function on E which can be expressed as f = g +h on E, where g is 
finite and integrable over E and h is nonnegative on E. Define f,, f = {, 9+ J, 4. Show that 
this is properly defined in the sense that it is independent of the particular choice of finite 
integrable function g and nonnegative function h whose sum is f. 
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32. Prove the General Lebesgue Dominated Convergence Theorem by following the proof of 
the Lebesgue Dominated Convergence Theorem, but replacing the sequences {g — f,} and 
{g + fn}, respectively, by {gn — fn} and {gn + fn}. 

33. Let {f,} be a sequence of integrable functions on E for which f, > f a.e.on E and f is 
integrable over E. Show that J,,|f — fn|—> 0 if and only if lim, 0 fe | fal = Jy l fl. (Hint: 
Use the General Lebesgue Dominated Convergence Theorem.) 


34. Let f be a nonnegative measurable function on R. Show that 


jim f r= r 


35. Let f be a real-valued function of two variables (x, y) that is defined on the square 
Q = {(x, y)|0 < x < 1,0 < y < 1} and is a measurable function of x for each fixed value 
of y. Suppose for each fixed value of x, limy_,9 f(x, y) = f(x) and that for all y, we have 
[f(x, y)| < g(x), where g is integrable over [0, 1]. Show that 


1 1 
Jim, | f(x, y)dx = | f(x)dx. 


Also show that if the function f(x, y) is continuous in y for each x, then 


J 
h(y) -/ f(x, y)dx 


is a continuous function of y. 


36. Let f be a real-valued function of two variables (x, y) that is defined on the square 
Q = {(x, y)|0 <x < 1,0 < y < 1} and is a measurable function of x for each fixed value of 
y. For each (x, y) € Q let the partial derivative 0 f/dy exist. Suppose there is a function g that 
is integrable over [0, 1] and such that 


oF, y) 


< g(x) for all (x, y) €Q. 
dy 


Prove that 


1 1 
all f(x, A =| (a y) dx for all y€[0, 1]. 


4.5 COUNTABLE ADDITIVITY AND CONTINUITY OF INTEGRATION 


The linearity and monotonicity properties of the Lebesgue integral, which we established 
in the preceding section, are extensions of familiar properties of the Riemann integral. In 
this brief section we establish two properties of the Lebesgue integral which have no coun- 
terpart for the Riemann integral. The following countable additivity property for Lebesgue 
integration is a companion of the countable additivity property for Lebesgue measure. 


Theorem 20 (the Countable Additivity of Integration) Let f be integrable over E and 
{E,}P°., a disjoint countable collection of measurable subsets of E whose union is E. Then 


[r=>] + (23) 
E n=1" En 
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Proof Let n be a natural number. Define f, = f - x, where y, is the characteristic function 
of the measurable set Ur, Ex. Then f, is a measurable function on E and 


lfnl < [fl on E. 


Observe that {f,}—> f pointwise on E. Thus, by the Lebesgue Dominated Convergence 


Theorem, 
| f= lm | fh. 
E n> C&O E 


On the other hand, since {£,}°C, is disjoint, it follows from the additivity over domains 


property of the integral that for each n, 


n 
| tn = > f. 
E k=1 Ex 
Thus 
| n oO 
f= tim, | fa = lim f\= f. 
E n>© E " n> © 2 E; 2 E, UO 
We leave it to the reader to use the countable additivity of integration to prove 
the following result regarding the continuity of integration: use as a pattern the proof of 
continuity of measure based on countable additivity of measure. 
Theorem 21 (the Continuity of Integration) Let f be integrable over E. 


(i) If {E,}°, is an ascending countable collection of measurable subsets of E, then 


| f= lim [| ¢ (24) 
Ux, En n->©O En 
(ti) If {E,}°°, is a descending countable collection of measurable subsets of E, then 
] f= lim | f. (25) 
Ne, En n-> C&O E, 
PROBLEMS 


37. Let f be a integrable function on E. Show that for each ¢ > 0, there is a natural number N 
for which if n > N, then Me, f| < «where E, = {x€ E| |x| > n}. 


38. For each of the two functions f on [1, oo) defined below, show that lim, -, 0 f i f exists while 
f is not integrable over [1, 00). Does this contradict the continuity of integration? 


(i) Define f(x) =(-1)"/n, forn <x<n+1. 
(ii) Define f(x) = (sinx)/x for 1 <x < oo. 


39. Prove the theorem regarding the continuity of integration. 
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4.6 UNIFORM INTEGRABILITY: THE VITALI CONVERGENCE THEOREM 


We conclude this first chapter on Lebesgue integration by establishing, for functions that are 
integrable over a set of finite measure, a criterion for justifying passage of the limit under 
the integral sign which is suggested by the following lemma and proposition. 


Lemma 22 Let E be a set of finite measure and 6 > 0. Then E is the disjoint union of a finite 
collection of sets, each of which has measure less than 6. 


Proof By the continuity of measure, 
lim m(E~[-—n, n]) = m(@) =0. 
n> oo 


Choose a natural number no for which m( E ~[—no, no]) < 6. By choosing a fine enough 
partition of [—no, no], express EN[—no, no] as the disjoint union of a finite collection of sets, 
each of which has measure less than 6. CJ 


Proposition 23 Let f be a measurable function on E. If f is integrable over E, then for each 
€ > 0, there isa 6 > 0 for which 


if A C E is measurable and m(A) < 6, then | lfl<e. (26) 
A 


Conversely, in the case m( E) < oo, if for each € > 0, there is a § > 0 for which (26) holds, then 
f is integrable over E. 


Proof The theorem follows by establishing it separately for the positive and negative parts 
of f. We therefore suppose f > 0 on E. First assume f is integrable over E. Let « > 0. 
By the definition of the integral of a nonnegative integrable function, there is a measurable 
bounded function f, of finite support for which 


0<f<fonzandd< | f- ff <e/2 


Since f — f. > 0 on E, if A C E is measurable, then, by the linearity and additivity over 
domains of the integral, 


J t-] t= [tr-ns[tr-s=f r-[e<er 


But f. is bounded. Choose M > 0 for which 0 < fe <M on Ep. Therefore, if A C E is 
measurable, then 


[ t<] srgrsm-ma)+e2 


Define 6 = ¢/2M. Then (26) holds for this choice of 5. Conversely, suppose m( E) < oo and 
for each € > 0, there is a 6 > 0 for which (26) holds. Let 59 > 0 respond to the € = 1 challenge. 
Since m( E) < oo, according to the preceding lemma, we may express E as the disjoint union 
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of a finite collection of measurable subsets {Ex}, each of which has measure less than 6. 
Therefore 


N 
S| FK<N. 
k= VE 


By the additivity over domains of integration it follows that if h is a nonnegative measurable 
function of finite support and 0 <h < f on E, then [ ph <N. Therefore f is integrable. UO 


Definition A family F of measurable functions on E is said to be uniformly integrable 
over’ E provided for each € > 0, there is a > 0 such that foreach f ef, 


if A C E is measurable and m( A) < 6, then | lfl<e. (27) 
A 


Example Let g be a nonnegative integrable function over E. Define 
F ={f | f ismeasurable on E and|f| < gon E}. 


Then ¥ is uniformly integrable. This follows from Proposition 23, with f replaced by g, and 
the observation that for any measurable subset A of E, by the monotonicity of integration, 


if f belongs to F, then 
fuisfs 
A A 


Proposition 24 Let { f,}?_, be a finite collection of functions, each of which is integrable over 
E. Then { fi},_, 8 uniformly integrable. 


Proof Let « > 0. For 1 <k <n, by Proposition 23, there is a 6; > 0 for which 
if A C EF is measurable and m( A) < 6;, then | fil <e. (28) 
A 


Define 5 = min{6, ..., 5,}. This 6 responds to the ¢ challenge regarding the criterion for the 
collection { f,}/_, to be uniformly integrable. O 


Proposition 25 Assume E has finite measure. Let the sequence of functions { fy} be uniformly 
integrable over E. If {fn} —> f pointwise a.e.on E, then f is integrable over E. 


Proof Let 5p > 0 respond to the € = 1 challenge in the uniform integrability criteria for the 
sequence {f,,}. Since m( E) < oo, by Lemma 22, we may express E as the disjoint union of a 
finite collection of measurable subsets {E;}/_, such that m(E,) < 5p for 1 < k < N. For any 
n, by the monotonicity and additivity over domains property of the integral, 


N 
[m= Ifnl <N. 
E k=1 4 Ex 


9 What is here called “uniformly integrable” is sometimes called “equiintegrable.” 
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We infer from Fatou’s Lemma that 


[isi stimint [fai <1. 


Thus | f| is integrable over E. LJ 


The Vitali Convergence Theorem Let E be of finite measure. Suppose the sequence of 
functions { f,} is uniformly integrable over E. 


If {fn} —> f pointwise a.e. on E, then f is integrable over Eand lim [ fn = | f. 
NOSE E 


Proof Propositions 25 tells us that f is integrable over E and hence, by Proposition 15, is 
finite a.e. on E. Therefore, using Proposition 15 once more, by possibly excising from E a set 
of measure zero, we suppose the convergence is pointwise on all of E and f is real-valued. 
We infer from the integral comparison test and the linearity, monotonicity, and additivity 
over domains property of integration that, for any measurable subset A of E and any natural 


number n, 
[n-fr [in-n) 
<[im-s 
=[ it si+ fia 


sf itm si+f imi+ fin 


Let « > 0. By the uniform integrability of {f,}, there isa > 0 such that J, | fn| < ¢/3 for any 
measurable subset of E for which m(A) < 6. Therefore, by Fatou’s Lemma, we also have 
J, \f| < €/3 for any measurable subset of A for which m(A) < 6. Since f is real-valued and 
E has finite measure, Egoroff’s Theorem tells us that there is a measurable subset Eo of E 
for which m( Ey) < 6 and {f,,} > f uniformly on E ~ Ep. Choose a natural number N such 
that | f, — f| <«/[3-m(E)] on E~ Ep for all n > N. Take A = Ep in the integral inequality 


[n-fr 
<¢/[3-m(E)]-m(E~ Ep) +€/3 + €/3 <€. 


(29) 


This completes the proof. O 
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The following theorem shows that the concept of uniform integrability is an essential 
ingredient in the justification, for a sequence {h,} of nonnegative functions on a set of finite 
measure that converges pointwise to h =0, of passage of the limit under the integral sign. 


Theorem 26 Let E be of finite measure. Suppose {h,} is a sequence of nonnegative integrable 
functions that converges pointwise a.e.on E to h=0. Then 


lim. h, = Oif and only if {h,} is uniformly integrable over E. 
n- E 


Proof If {h,} is uniformly integrable, then, by the Vitali Convergence Theorem, lim, -, oo 
fehn = 0. Conversely, suppose lim, - 0 f eltn = 0. Let « > 0. We may choose a natural 
number N for which f,. hy < «ifn > N. Therefore, since each h, > 0 on E, 


if A C Eis measurable and n > N, then | hy <e. (30) 
A 


According to Propositions 23 and 24, the finite collection {ha} is uniformly integrable 
over E. Let 6 respond to the € challenge regarding the criterion for the uniform integrability 


of (hy»}Na. We infer from (30) that 6 also responds to the ¢ challenge regarding the criterion 
for the uniform integrability of {h,}°C.,. O 


PROBLEMS 
40. Let f be integrable over R. Show that the function F defined by 


F(x)= | f for allxeR 


is properly defined and continuous. Is it necessarily Lipschitz? 
41. Show that Proposition 25 is false if E = R. 
42. Show that Theorem 26 is false without the assumption that the h,,’s are nonnegative. 


43. Let the sequences of functions {h,,} and {g,} be uniformly integrable over E. Show that for 
any a and £, the sequence of linear combinations {af,, + Bgn} also is uniformly integrable 
over E. 


44. Let f be integrable over R and € > 0. Establish the following three approximation properties. 
(i) There is a simple function 7 on R which has finite support and f, | f — | < (Hint: First 
verify this if f is nonnegative. | 
(ii) There is a step function s on R which vanishes outside a closed, bounded interval and 
Jp |f —5| <€. (Hint: Apply part (i) and Problem 18 of Chapter 3.) 


(ii) There is a continuous function g on R which vanishes outside a bounded set and 


In lf -al<e. 


45. Let f be integrable'over E. Define f to be the extension of f to all of R obtained by setting 
f =0 outside of E. Show that f is integrable over R and { cf = Spf. Use this and part (i) and 
(iii) of the preceding problem to show that for « > 0, there is a simple function 7 on E and a 
continuous function g on E for which /{ elf — nl <eand f elf —al<é. 
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46. (Riemann-Lebesgue) Let f be integrable over (—oo, 00). Show that 


47. 


48. 


49. 


50. 


51. 


52. 


0O 
lim | f(x) cosnxdx = 0. 
n>0ooJ_o. 


(Hint: First show this for f is a step function that vanishes outside a closed, bounded interval 
and then use the approximation property (n) of Problem 44.) 


Let f be integrable over (—o0, 00). 
(i) Show that for each f, 


[/ seyar= [7 Her nae 


(n) Let g be a bounded measurable function on R. Show that 


tim, f &(x)-[/)- slr+0)]=0 


(Hint: First show this, using uniform continuity of f on R, if f is continuous and vanishes 
outside a bounded set. Then use the approximation property (iii) of Problem 44.) 


Let f be integrable over E and g be a bounded measurable function on E. Show that f - g is 
integrable over E. 


Let f be integrable over R. Show that the following four assertions are equivalent: 


(i) f=OaeonR. 


(n) tr fg = 0 for every bounded measurable function g on R. 
(iii) {, f =0 for every measurable set A. 
(iv) f, f =0 for every open set O. 


Let F¥ be a family of functions, each of which is integrable over E. Show that F is uniformly 
integrable over E if and only if for each « > 0, there is a 6 > 0 such that for each fe fF, 


[ il<e 


Let F be a family of functions, each of which is integrable over E. Show that F is uniformly 
integrable over E if and only if for each « > 0, there is a 6 > 0 such that for all f € F, 


if A C E is measurable and m(A) <6, then 


if U is open and m(ENU) <6, then lfl <e. 
ENU 


(a) Let F be the family of functions f on [0, 1], each of which is integrable over [0, 1] and 
has h | f| <1. Is F uniformly integrable over [0, 1]? 

(b) Let F be the family of functions f on [0, 1], each of which is continuous on [0, 1] and 
has | f| < 1 on [0, 1]. Is ¥ uniformly integrable over [0, 1]? 

(c) Let F be the family of functions f on [0, 1], each of which is integrable over [0, 1] and 
has {” |f| < b—a forall [a, b] C[0, 1]. Is F uniformly integrable over [0, 1]? 
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In this brief chapter, we first consider a generalization of the Vitali Convergence Theorem 
to sequences of integrable functions on a set of infinite measure; for a pointwise convergent 
sequence of integrable functions, tightness must be added to uniform integrablity in order 
to justify passage of the limit under the integral sign. We then consider a mode of sequential 
convergence for sequences of measurable functions called convergence in measure and 
examine its relationship to pointwise convergence and convergence of integrals. Finally, we 
prove that a bounded function is Lebesgue integrable over a set of finite measure if and 
only if it is measurable, and that a bounded function is Riemann integrable over a closed, 
bounded interval if and only if it is continuous at almost all points in its domain. 


5.1 UNIFORM INTEGRABILITY AND TIGHTNESS: A GENERAL VITALI 
CONVERGENCE THEOREM 


The Vitali Convergence Theorem of the preceding chapter tells us that if m( E) < 00, {fn} is 
uniformly integrable over E and converges pointwise almost everywhere on E to f, then f 
is integrable over E and passage of the limit under the integral sign is justified, that is, 


lim J+]- lim f= | f. (1) 
n> oo E E n>oo E 
This theorem requires that E have finite measure. Indeed, for each natural number n, define 


tn = X{n,n+1) and f =0 on R. Then {f,} is uniformly integrable over R and converges 
pointwise on R to f. However, 


tim, | [6] =190= faim. = fs 


The following property of functions that are integrable over sets of infinite measure suggests 
an additional property which should accompany uniform integrability in order to justify 
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passage of the limit under the integral sign for sequences of functions on a domain of infinite 
measure. 


Proposition 1 Let f be integrable over E. Then for each € > 0, there is a set of finite measure 
Eo for which 
| Ifl<e. 
E~ Ep 


Proof Let « > 0. The nonnegative function | f| is integrable over E. By the definition of the 
integral of a nonnegative function, there is a bounded measurable function g on E, which 
vanishes outside a subset Eo of E of finite measure, for which 0 < g <(|fland / Elf|- f Ee <é. 
Therefore, by the linearity and additivity over domains properties of integration, 


[,_,,"l= J, lias [in-a<e 4 


Definition A family ¥ of measurable functions on E is said to be tight over E provided for 
each € > 0, there is a a subset Eo of E of finite measure for which 


| fl <eforall f eF. 
E~ Ep 


We infer from Proposition 23 of the preceding chapter that if F is a family of functions 
on £ that is uniformly integrable and tight over E, then each function in F is integrable 
over E. 


The Vitali Convergence Theorem Let { fn} be a sequence of functions on E that is uniformly 
integrable and tight over E. Suppose { f,}—> f pointwise a.e. on E. Then f is integrable over 


E and 
in [= [i 
n—> Cw E J 


Proof Let « > 0. By the tightness over E of the sequence { f,}, there is a measurable subset 
Eo of E which has finite measure and 


| | fn| < €/4 for all n. 
E~ Eo 


We infer from Fatou’s Lemma that [ E~ Ey |f| < €/4. Therefore f is integrable over E ~ Ep. 
Moreover, by the linearity and monotonicity of integration, 


[ern 


But Eo has finite measure and { f,} is uniformly integrable over Eo. Therefore, by the Vitali 
Convergence Theorem for functions on domains of finite measure, f is integrable over Eo 
and we may choose an index N for which 


i Uhl 


<| al + | |f| <€/2 for all n. (2) 
E~ Ep E~Eo 


<¢/2 for alln > N. (3) 
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Therefore f is integrable over E and, by (2) and (3), 


[tm-A <eforalln > N. 
E S 


The proof is complete. O 


We leave the proof of the following corollary as an exercise. 


Corollary 2 Let {h,} be a sequence of nonnegative integrable functions on E. Suppose 
{hn (x)}— 0 for almost all x in E. Then 


im h, = 0 if and only if {h,} is uniformly integrable and tight over E. 
n> E 

PROBLEMS 
1. Prove Corollary 2. 


2. Let {fi}f_, be a finite family of functions, each of’ which is integrable over E. Show that 
{ fx}{_, is uniformly integrable and tight over E. 

3. Let the sequences of functions {h,} and {g,} be uniformly integrable and tight over E. Show 
that for any a and B, {a f, + Bgn} also is uniformly integrable and tight over E. 

4. Let {f,,} be a sequence of measurable functions on E. Show that { f,,} is uniformly integrable 


and tight over E if and only if for each e > 0, there is a measurable subset Eo of E that has 
finite measure and a 6 > 0 such that for each measurable subset A of E and index n, 


if m(AN Eq) <6, then | lfnl <e. 
A 


5. Let {f,} be a sequence of integrable functions on R. Show that {f,,} is uniformly integrable 
and tight over R if and only if for each e > 0, there are positive numbers r and 6 such that for 
each open subset O of R and index n, 


ifm(ON(-r, r)) <4, then. l ful <e. 
O 


5.2 CONVERGENCE IN MEASURE 


We have considered sequences of functions that converge uniformly, that converge pointwise, 
and that converge pointwise almost everywhere. To this list we add one more mode of 
convergence that has useful relationships both to pointwise convergence almost everywhere 
and to forthcoming criteria for justifying the passage of the limit under the integral sign. 


Definition Let { f,,} be a sequence of measurable functions on E and f a measurable function 
on E for which f and-each f,, is finite a.e. on E. The sequence { f,} is said to converge in 
measure on E to f provided for each n > 0, 


‘lim m{xeE | |fa(x) - f(2)| > 0} =0. 
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When we write {f,}—> f in measure on E we are implicitly assuming that f and each 
fn 1s measurable, and finite a.e. on E. Observe that if {f,}—> f uniformly on E, and f is a 
real-valued measurable function on E, then {f,}—> f in measure on E since for n > 0, the 
set {xe E||f,(x) — f(x)| > n} is empty for n sufficiently large. However, we also have the 
following much stronger result. 


Proposition 3 Assume E has finite measure. Let { f,} be a sequence of measurable functions 
on E that converges pointwise a.e. on E to f and f is finite ae. on E. Then {f,}—> f in 
measure on E. 


Proof First observe that f is measurable since it is the pointwise limit almost everywhere 
of a sequence of measurable functions. Let n > 0. To prove convergence in measure we let 
€ > 0 and seek an index N such that 


m{xeE| |fu(x) — f(x)|>m} <eforalln > N. (4) 


Egoroff’s Theorem tells us that there is a measurable subset F of E with m(E~ F) < such 
that { f,} > f uniformly on F. Thus there is an index N such that 


lfn —f|<non F foralln > N. 


Thus, for n > N, {x € E||fn(x) — f(x)| >} C E~ F and so (4) holds for this choice of V.O 


The above proposition is false if E has infinite measure. The following example shows 
that the converse of this proposition also is false. 


Example Consider the sequence of subintervals of [0, 1], {J, }p°.,, which has initial terms 
listed as 


[0, 1], [0, 1/2], [1/2, 1], [0, 1/3], [1/3, 2/3], [2/3, 1], 
[0, 1/4], [1/4, 1/2], [1/2, 3/4], [3/4, 1]... 


For each index n, define f,, to be the restriction to [0, 1] of the characteristic func- 
tion of J,. Let f be the function that is identically zero on [0, 1]. We claim that 
{fn}—> f i measure. Indeed, observe that lim,-,..€(I,) = 0 since for each natural 
number m, 


1 
ifn>1+---+m= mn then £( I) <1/m. 


Thus, for 0 <n <1, since {x € E||fn(x) — f(x)| >} Ch, 


O< lim m{xeE£| |fa(x)— f(x)|>n}s lim e(I,) =0. 


However, it is clear that there is no point x in [0, 1] at which { f;,(x)} converges to f(x) since 
for each point x in [0, 1], f(x) = 1 for infinitely many indices n, while f(x) =0. 


Theorem 4 (Riesz) If {f,}—> f in measure on E, then there is a subsequence { fry} that 
converges pointwise a.e. on E to f. 
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Proof By the definition of convergence in measure, there is a strictly increasing sequence of 
natural numbers {n;} for which 


m{xeE | |fj(x)— f(x)|>1/k} < 1/2* for all j > nx. 
For each index k, define 
Ex = {x€E| | fay — f(x) > 1/k}. 
Then m(E;,) < 1/2* and therefore °°, m(Ex) < oo. The Borel-Cantelli Lemma tells 


us that for almost all xe, there is an index K(x) such that xg E, if k > K(x), 
that is, 


lfng (x) — f(x)| < 1/k for all k > K(x). 


Therefore 
slim fu(*) = (2). _ 
Corollary 5 Let {f,,} be a sequence of nonnegative integrable functions on E. Then 
sim, J fu =0 (5) 
if and only if 
{fn} > 0 in measure on E and { f,} is uniformly integrable and tight over E. (6) 


Proof First assume (5). Corollary 2 tells us that {f,} is uniformly integrable and tight over 
E. To show that {f,} > 0 in measure on E, let n > 0. By Chebychev’s Inequality, for each 
index n, 


1 
m{xeE | f>ays— | fh 
TY) JE 
Thus, 
1. 
O< lim m{xeE | fn > mh <— him, abe 
Hence { f,,} > 0 1n measure on E. 


To prove the converse, we argue by contradiction. Assume (6) holds but (5) fails to 
hold. Then there is some €p > 0 and a subsequence {f,,,} for which 


| fn = €0 for all k. 
E 


However, by Theorem 4, a subsequence of {f,,} converges to f =0 pointwise almost 
everywhere on E and this subsequence is uniformly integrable and tight so that, by the Vitali 
Convergence Theorem, we arrive at a contradiction to the existence of the above ¢o. This 
completes the proof. CO 
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PROBLEMS 
6. Let {f,}—> f in measure on E and g be a measurable function on E that is finite a.e. on E. 
Show that {f,} > g in measure on E if and only if f = ga.e. on E. 


7. Let E have finite measure, {f,}—> f in measure on E and g be a measurable function on 
E that is finite a.e. on E. Prove that {f, - g} > f -g in measure, and use this to show that 
{f7} > f? in measure. Infer from this that if {g,} > g in measure, then { fy - gn} > f-g in 
measure. 


8. Show that Fatou’s Lemma, the Monotone Convergence Theorem, the Lebesgue Dominated 
Convergence Theorem, and the Vitali Convergence Theorem remain vahd if “pointwise 
convergence a.e.”” is replaced by “convergence in measure.” 


9. Show that Proposition 3 does not necessarily hold for sets E of infinite measure. 


10. Show that linear combinations of sequences that converge in measure on a set of finite 
measure also converge in measure. 


11. Assume E has finite measure. Let {f,} be a sequence of measurable functions on E 
and f a measurable on E for which f and each f, is finite a.e. on E. Prove that 
{fn} > f in measure on E if and only if every subsequence of {f,} has in turn a further 
subsequence that converges to f pointwise a.e. on E. 


12. Show that a sequence {a;} of real numbers converges to a real number if |aj4; — a;| < 1/2/ 
for all j by showing that the sequence {a;} must be Cauchy. 


13. A sequence { f,} of measurable functions on E is said to be Cauchy in measure provided given 
7 > 0 and € > 0 there is an index N such that for all m,n > N, 


m {xeE | |fn(x) — fin(x)| = n} <€. 


Show that if {f,} is Cauchy in measure, then there is a measurable function f on E to which 
the sequence {f,} converges in measure. (Hint: Choose a strictly increasing sequence of 
natural numbers {n ;} such that for each index j, if Ej = (x € E| | fnj,,(*) — fnj(x)| > 1/2}, 


then m( E;) < 1/2’. Now use the Borel-Cantelli Lemma and the preceding problem.) 
14. Assume m(E) < oo. For two measurable functions g and h on E, define 


lg—hl 
g,h =| —?____., 
p(s, h) e1+(|g—-Al 


Show that { f,}-> f in measure on E if and only if lim, 0 p( fn, f) = 0. 


5.3 CHARACTERIZATIONS OF RIEMANN AND LEBESGUE INTEGRABILITY 


Lemma 6 Let {¢,} and {,} be sequences of functions, each of which is integrable over E, 
such that {,} is increasing while {,} is decreasing on E. Let the function f on E have the 
property that 

On < f <u, on E forall n. 


If 


then 


{Gn} — f pointwise a.e. on E, {i} —> f pointwise a.e. on E, f is integrable over E, 
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lim on= | f and lim =| oF 
n—a © E E sim, [ E 
Proof For x in E, define 

o(x) = lim gn(x) and y*(x) = im n(x). 


The functions are g* and y* properly defined since monotone sequences of extended real- 
valued numbers converge to an extended real number and they are measurable since each is 
the pointwise limit of a sequence of measurable functions. We have the inequalities 


gn <9 < f <W* < yf, on E for all n. (7) 


By the monotonicity and linearity of the integral of nonnegative measurable functions, 


0< | (vo) s | (dn —on) forall, 
E E 
so that 
0 * _ of li n— On) = 0. 
< [we o') < tim, | in —on) 


Since y* — g* is a nonnegative measurable function and [{ £(¥* — g*) = 0, Proposition 9 of 
Chapter 4 tells us that p* = g* a.e. on E. But g* < f < * on E. Therefore 


{gn} > f and {W,}—> f pointwise a.e. on E. 


Therefore f is measurable. Observe that since 0 < f —g; < Wy —g; on E and yy and gy) are 
integrable over E, we infer from the integral comparison test that f is integrable over E. We 
infer from inequality (7) that for all n, 


0<f e-f t= [me N< | n—om) 


and 
o<f s-f m=] (ten) | dn- on) 
E E E E 
and therefore 
lim {| g,= | f= lim | ay. LJ 
na © E i na wo E 


Theorem 7 Let f be a bounded function on a set of finite measure E. Then f is Lebesgue 
integrable over E if and only if it is measurable. 


Proof We have already shown that a bounded measurable function on a set of finite 
measure is Lebesgue integrable (see page 74). It remains to prove the converse. Suppose f 
is integrable. From the equality of the upper and lower Lebesgue integrals we conclude that 
there are sequences of simple functions {y,} and {,} for which 


On < f <u, on E for all n, 
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and 


sim, | fv» - Gn] = 


Since the maximum and minimum of a pair of simple functions are again simple, using 
the monotonicity of integration and by possibly replacing ¢, by max;<j<, 9; and w, by 
min} <j<, #i, We may suppose {¢,} is increasing and {y,} is decreasing. By the preceding 
lemma, {y,}— f pointwise almost everywhere on E. Therefore f is measurable since it is 
the pointwise limit almost everywhere of a sequence of measurable functions. O 


At the very beginning of our consideration of integration, we showed that if a bounded 
function on the closed, bounded interval [a, b] is Riemann integrable over [a, b], then it is 
Lebesgue integrable over [a, b] and the integrals are equal. We may therefore infer from 
the preceding theorem that if a bounded function on [a, b] is Riemann integrable, then it is 
measurable. The following theorem is much more precise. 


Theorem 8 (Lebesgue) Let f be a bounded function on the closed, bounded interval [a, b}. 
Then f is Riemann integrable over |a, b| if and only if the set of points in [a, b| at which f 
fails to be continuous has measure zero. 


Proof We first suppose f is Riemann integrable. We infer from the equality of the upper 
and lower Riemann integrals over [a, b] that there are sequences of partitions {P,,} and {P’} 
of [a, b] for which 


jim [U(F, Pn) — LF, Pn)] =9, 


where U( f, P,) and L( f, P,) upper and lower Darboux sums. Since, under refinement, 
lower Darboux sums increase and upper Darboux sums decrease, by possibly replacing 
each P, by acommon refinement of P;,..., Pn, P;,..., P,, we may assume each P,4; is a 
refinement of P, and P, = P,. For each index n, define ¢, to be the lower step function 
associated with f with respect to P,,, that is, which agrees with f at the partition points of P,, 
and which on each open interval determined by P,, has constant value equal to the infimum 
of f on that interval. We define the upper step function yw, in a similar manner. By definition 
of the Darboux sums, 


b b 
L(f, Pn) =| g, and U( f, Pr) =| W, for all n. 
Then {g,} and {#,} are sequences of integrable functions such that for each index n, 


Qn < f < W, on E. Moreover, the sequence {¢,,} is increasing and {y/,} is decreasing, because 
each P,4; is a refinement of P,,. Finally, 


dim, [dn — ol = im 1 [UC Px) - LS, Pn)] =0. 


We infer from the preceding lemma that 


{~n} > f and {,}— f pointwise a.e on [a, 5]. 
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The set E of points x at which either {y,(x)} or {@n(x)} fail to converge to f(x) has measure 
0. Let Eo be the union of E and the set of all the partition points in the P,’s. As the union 
of a set of measure zero and a countable set, m( Eo) = 0. We claim that f is continuous at 
each point in E ~ Eo. Indeed, let xo belong to E ~ Eo. To show that f is continuous at xo, let 
€ > 0. Since {ifn(xo)} and {yp (xo )} converge to f(xo), we may choose a natural number ng 
for which 


f(x0) — € < Gno(20) S F(x0) S Wng(x0) < f(x0) +. (8) 


Since xo is not a partition point of P,,, we may choose 6 > 0 such that the open interval 
(xo — 6, xo + 8) is contained in the open interval J, determined by P,,, Which contains x0. 
This containment implies that 


if |x — x9| < 6, then @n.(x0) < Gny(x) < f(x) < Wny(x) < Un (x). 
From this inequality and inequality (8) we infer that 
if |x — x9| < 6, then | f(x) — f(x0)| <e. 


Thus f is continuous at xo. 


It remains to prove the converse. Assume f is continuous at almost all points in [a, 5]. 
Let {P,,} be any sequence of partitions of [a, b] for which! 


im. gap P, = 0. 


We claim that 
tim [U(f, Pn) —L(f, Pa)| =0. (9) 


If this is verified, then from the following estimate for the lower and upper Riemann 
integrals, 


Tb b 
o<| f-| f <[UCS, Px) — L(f, Py) for all n, 


we conclude that f is integrable over [a, b]. For each n, let g, and ys, be the lower and upper 
step functions associated with f over the partition P,. To prove (9) is to prove that 


b 
sim, fn ~ en] = 0. (10) 


The Riemann integral of a step function equals its Lebesgue integral. Moreover, since the 
function f is bounded on the bounded set [a, b], the sequences {¢,} and {w,} are uniformly 
bounded on [a, b]. Hence, by the Bounded Convergence Theorem, to verify (10) it suffices 
to show that {¢,}—> f and {#%,}—> f pointwise on the set of points in (a, b) at which f is 
continuous and which are not partition points of any partition P,. Let xp be such a point. We 
show that 


tim ¢n(x0) = flo) and lim n(x0) = f (x0). (11) 


The gap of a partition P is defined to be the maximum distance between consecutive points of the partition. 


106 Chapter 5 Lebesgue Integration: Further Topics 


Let « > 0. Let 5 > 0 be such that 
f (x0) — €/2 < f(x) < f(x) + €/2 if |x — x0| < 6. (12) 


Choose an index N for which gap P, < 6 ifn > N. Ifn > N and J, is the open partition 
interval determined by P,,, which contains xo, then J, C (xp — 8, x9 + 5). We infer from (12) 
that 


f (x0) — €/2 < gn(x0) < f(x0) < Wn(x0) < f(x0) + €/2 
and therefore 


0 < Wn(x0) — f(x0) <€ and 0 < f(x9) — on(xo) <€ for alln > N. 


Thus (11) holds and the proof is complete. L 


PROBLEMS 


15. Let f and g be bounded functions that are Riemann integrable over [a, b]. Show that the 
product fg also is Riemann integrable over [a, 5]. 


16. Let f be a bounded function on [a, b] whose set of discontinuities has measure zero. Show 
that f is measurable. Then show that the same holds without the assumption of boundedness. 

17. Let f be a function on [0, 1] that is continuous on (0, 1]. Show that it is possible for the 
sequence { Sn jn,1] £} to converge and yet f is not Lebesgue integrable over [0, 1]. Can this 
happen if f is nonnegative? 


CHAPTER 6 


Differentiation and Integration 


Contents 
6.1 Continuity of Monotone Functions ............... 0.000 eee 108 
6.2 Differentiability of Monotone Functions: Lebesgue’s Theorem ........ 109 
6.3. Functions of Bounded Variation: Jordan’s Theorem .............. 116 
6.4 Absolutely Continuous Functions ............. 0.0.0 ee eee 119 
6.5 Integrating Derivatives: Differentiating Indefinite Integrals .......... 124 
6.6 ConvexFunctions .... 0... 0.0.0... ee 130 


The fundamental theorems of integral and differential calculus, with respect to the Riemann 
integral, are the workhorses of calculus. In this chapter we formulate these two theorems for 
the Lebesgue integral. For a function f on the closed, bounded interval [a, b], when is 


b 
| f= f(b) — f(a)? (i) 


Assume f is continuous. Extend f to take the value f(b) on (b, b+ 1], and for0 <A <1, 
define the divided difference function Diff, f and average value function Av; f on [a, b] by 


Dit fx) = LE+H =F 


1 x+h 
and Av; f(x) = | f(t) dt for all x in [a, 5]. 
x 
A change of variables and cancellation provides the discrete formulation of (i) for the 
Riemann integral: 


b 
[ditt = av.s(6) - Avis(a), 


The limit of the right-hand side as h > 0* equals f(b) — f(a). We prove a striking theorem 
of Henri Lebesgue which tells us that a monotone function on (a, b) has a finite derivative 
almost everywhere. We then define what it means for a function to be absolutely continuous 
and prove that if f is absolutely continuous, then f is the difference of monotone functions 
and the collection of divided differences, (Diff, fJo<,<1, is uniformly integrable. Therefore, 
by the Vitali Convergence Theorem, (i) follows for f absolutely continuous by taking the 
limit as h — O* in its discrete formulation. If f is monotone and (i) holds, we prove that f 
must be absolutely continuous. From the integral form of the fundamental theorem, (i), we 
obtain the differential form, namely, if f is Lebesgue integrable over [a, b], then 


[ / = f(x) for almost all x in [a, b]. (ii) 
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6.1 CONTINUITY OF MONOTONE FUNCTIONS 


Recall that a function is defined to be monotone if it is either increasing or decreasing. 
Monotone functions play a decisive role in resolving the question posed in the preamble. 
There are two reasons for this. First, a theorem of Lebesgue (page 112) asserts that a 
monotone function on an open interval is differentiable almost everywhere. Second, a 
theorem of Jordan (page 117) tells us that a very general family of functions on a closed, 
bounded interval, those of bounded variation, which includes Lipschitz functions, may be 
expressed as the difference of monotone functions and therefore they also are differentiable 
almost everywhere on the interior of their domain. In this brief preliminary section we 
consider continuity properties of monotone functions. 


Theorem 1 Let f be a monotone function on the open interval (a, b). Then f is continuous 
except possibly at a countable number of points in (a, b). 


Proof Assume f is increasing. Furthermore, assume (a, b) is bounded and f is increasing 
on the closed interval [a, b]. Otherwise, express (a, b) as the union of an ascending sequence 
of open, bounded intervals, the closures of which are contained in (a, b), and take the union 
of the discontinuities in each of this countable collection of intervals. For each xp € (a, b), f 
has a limit from the left and from the right at xo. Define 


F(x) = lim_ f(x) =sup (f(x) | @<x<x0}, 


F(xq) = lim, f(x) = inf { f(x) | xy <x<b}. 


Since f is increasing, f(x ) < f (xj ). The function f fails to be continuous at xo if and only 
if f(x9 ) < f(xg ), in which case we define the open “jump” interval J(x9) by 


J(x0) = YI F229) << FXG}. 


Each jump interval is contained in the bounded interval [f(a), f(b)] and the collection 
of jump intervals is disjoint. Therefore, for each natural number 7, there are only a finite 
number of jump intervals of length greater than 1/n. Thus the set of points of discontinuity 
of f is the union of a countable collection of finite sets and therefore is countable. O 


Proposition 2 Let C be a countable subset of the open interval (a, b). Then there is an 
increasing function on (a, b) that is continuous only at points in (a, b) ~C. 


Proof If C is finite the proof is clear. Assume C is countably infinite. Let {q,}”° 


n=1 be an 
enumeration of C. Define the function f on (a, b) by setting ! 


1 
fixi= > wy for alla<x <b. 


{n|Gn<x} 


1 We use the convention that a sum over the empty-set is zero. 
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Since a geometric series with a ratio less than 1 converges, f is properly defined. Moreover, 


ifa<u<v<b, then f(v)-f(u)= > i (1) 


n 
{n |u<qn<v} 


Thus f is increasing. Let x9 = q; belong to C. Then, by (1), 


f (x0) — f(x) = - for all x < xp. 


Therefore f fails to be continuous at x9. Now let xo belong to (a, b) ~ C. Let n be a natural 
number. There is an open interval J containing xo for which q, does not belong to / for 
1 <k <n. We infer from (1) that | f(x) — f(x0)| < 1/2” for all xe J. Therefore f is 
continuous at xo. 0 


PROBLEMS 


1. Let C be a countable subset of the nondegenerate closed, bounded interval [a, b]. Show that 
there is an increasing function on [a, b] that is continuous only at points in [a, b] ~C. 


2. Show that there is a strictly increasing function on [0, 1] that is continuous only at the 
irrational numbers in [0, 1]. 


3. Let f be a monotone function on a subset E of R. Show that f is continuous except possibly 
at a countable number of points in E. 


4. Let E be asubset of R and C a countable subset of E. Is there a monotone function on E that 
is continuous only at points in E~ C? 
6.2 DIFFERENTIABILITY OF MONOTONE FUNCTIONS: LEBESGUE’S THEOREM 
A closed, bounded interval [c, d] is said to be nondegenerate provided c < d. 
Definition A collection ¥ of closed, bounded, nondegenerate intervals is said to cover a set 


E in the sense of Vitali provided for each point x in E and € > 0, there is an interval I in F 
that contains x and has €(I) <. 


The Vitali Covering Lemma Let E be a set of finite outer measure and F a collection of 


closed, bounded intervals that covers E in the sense of Vitali. Then for each € > 0, there is a 
finite disjoint subcollection {Ix};_, of F for which 


m* 


ex a <e @) 


k=1 


Proof Since m*(E) < oo, there is an open set O containing E for which m(O) < oo. Because 
F is a Vitali covering of E, we may assume that each interval in ¥ is contained in O. By the 
countable additivity and monotonicity of measure, 


oO 
if (1x}72, C F is disjoint, then S) £(1,) < m(O) < oo. (3) 
k=l 
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Moreover, since each I; is closed and F is a Vitali covering of E, 


ie, @) 
if (jg; CF , then E~|(JXC \_) I where F,, =\ree 
k=1 IEF yp 


n 
In J i = i. (4) 
k=1 
If there is a finite disjoint subcollection of F that covers E, the proof is complete. Otherwise, 
we inductively choose a disjoint countable subcollection {1;,}?°., of F which has the following 
property: 
n oo 
E~VkC J 5 « J, for all n, (5) 
k=1 k=n+1 

where, for a closed, bounded interval J, 5 * J denotes the closed interval that has the same 
midpoint as J and 5 times its length. To begin this selection, let J; be any interval in F. 
Suppose n is a natural number and the finite disjoint subcollection {I,}/_, of F has been 
chosen. Since E~ U7 _, 1, #9, the collection F,, defined in (4) is nonempty. Moreover, the 
supremum, s,, of the lengths of the intervals in F ,, is finite since m(O) is an upper bound for 
these lengths. Choose J, ,; to be an interval in ¥,, for which €( 1,41) > s,/2. This inductively 
defines {J,}7°,, a countable disjoint subcollection of F such that for each n, 


n 
€(Ing1) > €(1)/2if le F and IN| J k = 9. (6) 
k=1 

We infer from (3) that {€( J; )} — 0. Fix a natural number n. To verify the inclusion (5), let 
x belong to E~ U;_, 1k. We infer from (4) that there is an J € F which contains x and is 
disjoint from ;_, 1,. Now J must have nonempty intersection with some J;, for otherwise, 
by (6), £(1,) > €(1)/2 for all k, which contradicts the convergence of {£(J;,)} to 0. Let N be 
the first natural number for which JN Iy #@. Then N > n. Since IN UN) I, = 0, we infer 
from (6) that £(1y) > €()/2. Since x belongs to J and IN Iy #9, the distance from x to the 
midpoint of I is at most £(/) + 1/2 - €(In) and hence, since €(I) < 2 - £(Iy), the distance 
from x to the midpoint of Iy is less than 5/2 - (Jy ). This means that x belongs to 5 * Iy. 
Thus, 


0O 
xeSxInC J Sk. 
k=n+1 


We have established the inclusion (5). 


Lete > 0. We infer from (3) that here is a natural number n for which Y7°.,, , , (Jk) 
< €/5. This choice of n, together with the inclusion (5) and the monotonicity and countable 
additivity of measure, establishes (2). CO 


For a real-valued function f and an interior point x of its domain, the upper derivative 
of f atx, Df (x) and the lower derivative of f at x, Df (x) are defined as follows: 


fos $0), 
t 


’ 


Df(x)= fim, c 0<|t\<h 


Df(x)= jim intoapan t= LO) 
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We have Df (x) > Df (x). If Df(x) equals D f(x) and is finite, we say that f is differentiable 
at x and define f’(x) to be the common value of the upper and lower derivatives. 

The Mean Value Theorem of calculus tells us that if a function f is continuous on the 
closed, bounded interval [c, d] and differentiable on its interior (c, d) with f’ > aon(c, d), 
then 


a-(d—c) <[f(d) — f(c)]. 


The proof of the following generalization of this inequality, inequality (7), is a nice illustration 
of the fruitful interplay between the Vitali Covering Lemma and monotonicity properties of 
functions. 


Lemma 3 Let f be an increasing function on the closed, bounded interval [a, b|. Then, for 
eacha > 0, 


m*{xe(a, b)| Df(x) 2 a} < — -[f(b) - f(a)] (7) 


Ril 


and 
m*{xé(a, b)| Df(x) = co} = 0. (8) 


Proof Let a > 0. Define Ey = {x €(a, b)| Df(x) > a}. Choose a’ € (0, a). Let F be the 
collection of closed, bounded intervals [c, d] contained in (a, b) for which f(d) — f(c) > 
a’(d —c). Since Df > a on Eg, F is a Vitali covering of Ey. The Vitali Covering Lemma 
tells us that there is a finite disjoint subcollection {[cx, d,]}j_, of F for which 


n 


Ea™ Ick, a <€. 


k=1 


m* 


Since Ey CUf_;[ck, d&k]U {Ea ~ Us_i ce, deJ}, by the finite subadditivity of outer measure, 
the preceding inequality and the choice of the intervals [c;, d;], 


m*(Ea) <> (de- ex) te <> - SUylde) — flex] +e 0 
k=1 k=1 


However, the function f is increasing on [a, b] and {[cx, dy]}7_, is a disjoint collection of 
subintervals of [a, b]. Therefore 


Yield) ~ Fler] < f(b) - F(a) 
Thus for each ¢ > 0, and each a’ €(0, a), 
m*( Ea) < -Lf(b) - fla] +e 
This proves (7). For each natural number n, {x € (a, b) | Df(x) = 00} C E, and therefore 


m*{x €(a, b)| Df (x) = 00} < m*(E,) < - -( f(b) — f(a)). 


This proves (8). L 
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Lebesgue’s Theorem If the function f is monotone on the open interval (a, b), then it is 
differentiable almost everywhere on (a, b). 


Proof Assume f is increasing. Furthermore, assume (a, b) is bounded. Otherwise, express 
(a, b) as the union of an ascending sequence of open, bounded intervals and use the 
continuity of Lebesgue measure. The set of points x in (a, b) at which Df(x) > Df (x) is 
the union of the sets 


Exp = (r€(a, b)| Df (x) > a>B> Df(x)} 


where a and £ are rational numbers. Hence, since this is a countable collection, by the 
countable subadditivity of outer measure, it suffices to prove that each E,y,g has outer 
measure zero. Fix rationals a, B with a > B and set E = E,,g. Let € > 0. Choose an open set 
O for which 

ECOC(a, b) andm(O) <m*(E) +e. (10) 


Let F be the collection of closed, bounded intervals [c, d] contained in O for which 
f(d)—f(c)<B(d—c). Since Df <Bon E, F isa Vitali covering of E. The Vitali Covering 
Lemma tells us that there is a finite disjoint subcollection {[cx, d,]}/_, of F for which 


m* 


<e. (11) 


n 
E~ (Jlex, de] 
k=1 


By the choice of the intervals [c;,, d,], the inclusion of the union of the disjoint collection 
intervals {[cx, dy ]}f_, in O and (10), 


Ula) — f(cx)] <B > (di a) < B-m(O) <B- [m*(E) +]. (12) 


For 1 < k <n, we infer from the preceding lemma, applied to the restriction of f to [cx, dy], 
that 


m*(EO (ct, de)) < ~Lf(de) ~ F(ca)} 
Therefore, by (11), 


m*(E) < Si m"(EN (cx, dk) $e < Su -sa]te (3) 


k=1 k=1 


We infer from (12) and (13) that 


8 


1 
m*(E) ‘m"(E)+—-¢+e forall > 0. 


Therefore, since 0 < m*( E) < oo and B/a <1, m*(E) =0. O 
Lebesgue’s Theorem is the best possible in the sense that if E is a set of measure zero 


contained in the open interval (a, b), there is an increasing function on (a, b) that fails to 
be differentiable at each point in E (see Problem 10). 
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Remark Frigyes Riesz and Béla Sz.-Nagy* remark that Lebesgue’s Theorem is “one of the 
most striking and most important in real variable theory.” Indeed, in 1872 Karl Weierstrass 
presented mathematics with a continuous function on an open interval which failed to be 
differentiable at any point.’ Further pathology was revealed and there followed a period of 
uncertainty regarding the spread of pathology in mathematical analysis. Lebesgue’s Theorem, 
which was published in 1904, and its consequences, which we pursue in Section 5, helped 
restore confidence in the harmony of mathematics analysis. 


Let f be integrable over the closed, bounded interval [a, b]. Extend f to take the value 
f(b) on (b, b+1]. For 0<h <1, define the divided difference function Diff; f and average 
value function Av, f of [a, b] by 


— X+h 
Diff, f(x) = Fash= fe) and Av; f(x) = : | f for all x € [a, b]. 


By a change of variables in the integral and cancellation, for alla <u <v <b, 
v 
| Diff, f = Av, f(v) — Ava, f(u). (14) 


Corollary 4 Let f be an increasing function on the closed, bounded interval {a, b]. Then f’ is 
integrable over [a, b] and 


[ f’ < f(b) - f(a). (15) 


Proof Since f is increasing on [a, b + 1], it is measurable (see Problem 22) and therefore 
the divided difference functions are also measurable. Lebesgue’s Theorem tells us that 
f is differentiable almost everywhere on (a, b). Therefore {Diff,/, f} is a sequence of 
nonnegative measurable functions that converges pointwise almost everywhere on [a, 5] 
to f’. According to Fatou’s Lemma, 


b 
| f’ < liminf 
a n—-> 0O 


b 
| Dif, / (16) 


By the change of variable formula (14), for each natural number n, since f is increasing, 


b 1 b+1/n 1 +1/n 1 a+1/n 
[ ditint=7. J f-a ff f= s)- 7+ < f(b) — f(a). 


Thus 


b 
lim sup | Diff /n j < f(b) — f(a). (17) 
n—-> Oo a 
The inequality (15) follows from the inequalities (16) and (17). CO] 


2See page 5 of their book Functional Analysis [RSN90]. 
3A simpler example of such a function, due to Bartel van der Waerden, is examined in Chapter 8 of Patrick 
Fitzpatrick’s Advanced Calculus [Fit09]. 
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Remark The integral in (15) is independent of the values taken by f at the endpoints. On 
the other hand, the right-hand side of this equality holds for the extension of any increasing 
extension of f on the open, bounded interval (a, b) to its closure [a, b]. Therefore a tighter 
form of equality (15) is 


fi'< sup f(x)- inf f(x). (18) 
a x€ (a,b) xeé(a,b) 

The right-hand side of this inequality equals f(b) — f(a) if and only if f is continuous at 
the endpoints. However, even if f is increasing and continuous on [a, b], inequality (15) 
may be strict. It is strict for the Cantor- Lebesgue function ¢ on [0, 1] since (1) — 9(0) = 1 
while g' vanishes almost everywhere on (0, 1). We show that for an increasing function f on 
[a, b], (15) is an equality if and only if the function is absolutely continuous on [a, b] (see the 
forthcoming Corollary 12). 


Remark Fora continuous function f on aclosed, bounded interval [a, b] that is differentiable 
on the open interval (a, b), in the absence of a monotonicity assumption on f we cannot infer 
that its derivative f’ is integrable over [a, b]. We leave it as an exercise to show that for f 
defined on [0, 1] by 


_ J x*sin(1/x*) for0<x<1 
se)=4 § forx =0, 


f' is not integrable over [0, 1]. 


PROBLEMS 


5. Show that the Vitali Covering Lemma does not extend to the case in which the covering 
collection has degenerate closed intervals. 


6. Show that the Vitali Covering Lemma does extend to the case in which the covering collection 
consists of nondegenerate general intervals. 


7. Let f be continuous on R. Is there an open interval on which f is monotone? 


8. Let J and J be closed, bounded intervals and y > 0 be such that ¢(/) > y - é( J). Assume 
IN J #9. Show that if y > 1/2, then J C5 * 7, where 5 * J denotes the interval with the same 
center as / and five tunes its length. Is the same true if 0 < y < 1/2? 


9. Show that a set E of real numbers has measure zero if and only if there is a countable 
collection of open intervals {1,}?° , for which each point in E belongs to infinitely many of the 

[,’s and 2%, £(1,) < 00. 
10. (Riesz-Nagy) Let E be a set of measure zero contained in the open interval (a, b). According 
to the preceding problem, there is a countable collection of open intervals contained in (a, b), 


{(cx, dk )}2,, for which each point in E belongs to infinitely many intervals in the collection 
and Sr. 1(dk — cx) < o. Define 
io. @) 
f(x) = ¥ L((cz, de) A(—oo, x)) for all x in (a, b). 
k=1 | 


Show that f is increasing and fails to be differentiable at each point in E. 


11. 


12. 
13. 


14. 


15. 


16. 


17. 
18. 


19. 


20. 


21. 


22. 
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For real numbers a < B and y > 0, show that if g is integrable over [a + y, B + y], then 


B+y 


f serna =| g(t) dt. 


a+y 


Prove this change of variables formula by successively considering simple functions, bounded 
measurable functions, nonnegative integrable functions, and general integrable functions. 
Use it to prove (14). 


Compute the upper and lower derivatives of the characteristic function of the rationals. 


Let E be a set of finite outer measure and F a collection of closed, bounded intervals that 
cover E in the sense of Vitali. Show that there is a countable disjoint collection {J,};°, of 
intervals in F for which 


m* 


ie, ¢) 
E~\J q =0. 
k=1 


Use the Vitali Covering Lemma to show that the union of any collection (countable or 
uncountable) of closed, bounded nondegenerate intervals is measurable. 


Define f on R by 
_ J x sin(1/x) ifx40 
s0)=4 § ifx=0. 
Find the upper and lower derivatives of f at x = 0. 
Let g be integrable over [a, b]. Define the antiderivative of g to be the function f defined on 
[a, b] by 
Xx 
f(x)= | g for all x €[a, b]. 


Show that f is differentiable almost everywhere on (a, b). 


Let f be an increasing bounded function on the open, bounded interval (a, b). Verify (15). 
Show that if f is defined on (a, b) andc € (a, b) is a local minimizer for f, then Df(c) <0 < 


Df (c). 

Let f be continuous on [a, b] with Df > 0 on (a, b). Show that f is increasing on [a, b]. (Hint: 
First show this for a function g for which Dg > « > 0 on (a, b). Apply this to the function 
g(x) = f(x) + ex.) 

Let f and g be real-valued functions on (a, b). Show that 


Df + Dg < D(f +g) < D(f +8) < Df + Dg on (a, d). 


Let f be defined on [a, b] and g a continuous function on [a, f] that is differentiable at 
y€(a, B) with g(y) = ce (a, b). Verify the following. 
(i) Ifg'(y) > 0, then D( f og)(y) = Df(c) - 8’(y). 


(ii) If g’(y) = Oand the upper and lower derivatives of f at c are finite, then D( fog)(y) =0. 


Show that a strictly increasing function that is defined on an interval is measurable and then 
use this to show that a monotone function that is defined on an interval is measurable. 
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_ 23. Show that a continuous function f on [a, b] is Lipschitz if its upper and lower derivatives are 
bounded on (a, b). 


24. Show that for f defined in the last remark of this section, f’ is not integrable over [0, 1]. 


6.3 FUNCTIONS OF BOUNDED VARIATION: JORDAN'S THEOREM 


Lebesgue’s Theorem tells us that a monotone function on an open interval is differentiable 
almost everywhere. Therefore the difference of two increasing functions on an open interval 
also is differentiable almost everywhere. We now provide a characterization of the class 
of functions on a closed, bounded interval that may be expressed as the difference of 
increasing functions, which shows that this class is surprisingly large: it includes, for instance, 
all Lipschitz functions. 

Let f be a real-valued function defined on the closed, bounded interval [a, b] and 
P = {xo, ..., xz} be a partition of [a, b]. Define the variation of f with respect to P by 


k 
VF, P) = DIF (x) — F(%i-1)1, 
i=1 
and the total variation of f on [a, b] by 
TV(f) =sup {V(f, P)| Pa partition of [a, b]}. 


For a subinterval [c, d] of [a, b], TV( fic, a)) denotes the total variation of the restriction of 
f to[c, d]. 


Definition A real-valued function f on the closed, bounded interval [a, b] is said to be of 
bounded variation on [a, b] provided 
TV(f) <0. 


Example Let f be an increasing function on [a, b]. Then f is of bounded variation on [a, b] 
and 


TV(f) = f(b) - f(a). 
Indeed, for any partition P = {xo, ..., x} of [a, D], 


k 


k 
VF, P) = DIF (ai) — f(ai-1) | = Xf (xi) — f(xi-1)] = f(b) - f(a). 


i= 


Example Let f be a Lipschitz function on [a, b]. Then fis of bounded variation of [a, 5], 
and TV( f) <c-(b—a), where 


|f(u) — f(v)| < clu — v| for all u, vin [a, 5]. 


Indeed, for a partition P = {xo, ..., x,} of [a, 5], 


k k 
Vif, P) = > If ai) - f(xi-1)| s+ Si - x1-1] = c- [b - a]. 


i=1 i=1 
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Thus, c - [b — a] is an upper bound of the set of all variations of f with respect to a partition 
of [a, b] and hence TV( f) <c- [b— al. 


Example Define the function f on [0, 1] by 
x cos(w/2x) if0<x<1 
ner 


0 if x = 0. 


Then f is continuous on [0, 1]. But f is not of bounded variation on [0, 1]. Indeed, for 
a natural number 7, consider the partition P, = {0, 1/2n, 1/[2n —1], ..., 1/3, 1/2, 1} of 
[0, 1]. Then 

V(f, Po) =14+1/24+...+1/n. 


Hence f is not of bounded variation on [0, 1], since the harmonic series diverges. 


Observe that if c belongs to (a, b), P is a partition of [a, b], and P’ is the refinement of 
P obtained by adjoining c to P, then, by the triangle inequality, V( f, P) < V(f, P’). Thus, 
in the definition of the total variation of a function on [a, b], the supremum can be taken 
over partitions of [a, b] that contain the point c. Now a partition P of [a, b] that contains the 
point c induces, and is induced by, partitions P; and P» of [a, c] and [c, b], respectively, and 
for such partitions 


V( fia, b}» P) = V( fia,» Pi) + VC fic, oj» P2)- (19) 
Take the supremum among such partitions to conclude that 
TV( ffa,6]) = TV( fla, }) + TV Fe, 0)): © (20) 


We infer from this that if f is of bounded variation on [a, b], then 
TV( fla, y) - TV( fla, u}) =TV( fiu, »)) 2 Oforalla<u<v<b. (21) 


Therefore the function x+> TV( fia, xj), Which we call the total variation function for f, is a 
real-valued increasing function on [a, b]. Moreover, for a < u<v < b, if we take the crudest 
partition P = {u, v} of [u, v], we have 


f(u) — f(v) < If (v) — F(@)| = Vffu, ys P) S TV fou, ) = TV Spa, y) — TV Fa, u)- 
Thus 


f(v) + TV( fla, v}) > f(u)+ TV( fla, u}) foralla<u<v<b. (22) 


We have established the following lemma. 


Lemma 5 Let the function f be of bounded variation on the closed, bounded interval {a, b}. 
Then f has the following explicit expression as the difference of two increasing functions on 
[a, b]: 

f(x) =[F(x) +TV( fa, x1)] -— TV( fa, x) for all x € [a, 5]. (23) 


Jordan’s Theorem A function f is of bounded variation on the closed, bounded interval 
(a, b] if and only if it is the difference of two increasing functions on [a, ]. 
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Proof Let / be of bounded variation on [a, b]. The preceding lemma provides an explicit 
representation of f as the difference of increasing functions. To prove the converse, let 
f = g—h on [a, b], where g and h are increasing functions on [a, b]. For any partition 
P = {xo,..., xx} of [a, b], 


V(f. P)= Bisa) ~ fli) 
= J I[g() - g(xj-1)] + [h(4j-1) - h(x;)]| 
< Blasi) — o(4i-1)1+ 3 fa(ae) — WC) 


= lel) = e(aia)) + BUA) ~ hi) 


=[s(b) — g(a)] + [h(b) — h(a)]. 


Thus, the set of variations of f with respect to partitions of [a, b] is bounded above by 
[e(b) — g(a)] + [A(b) — h(a)] and therefore f is of bounded variation of [a, 5]. O 


We call the expression of a function of bounded variation f as the difference of 
increasing functions a Jordan decomposition of f. 


Corollary 6 If the function f is of bounded variation on the closed, bounded interval [a, b], 
then it is differentiable almost everywhere on the open interval (a, b) and f" is integrable over 
[a, b]. 


Proof According to Jordan’s Theorem, f is the difference of two increasing functions on 
[a, b]. Thus Lebesgue’s Theorem tells us that f is the difference of two functions which are 
differentiable almost everywhere on (a, b). Therefore f is differentiable almost everywhere 
on (a, b). The integrability of f’ follows from Corollary 4. O 


PROBLEMS 


25. Suppose f is continuous on [0, 1]. Must there be a nondegenerate closed subinterval [a, b] 
of [0, 1] for which the restriction of f to [a, b] is of bounded variation? 


26. Let f be the Dirichlet function, the characteristic function of the rationals in (0, 1]. Is f of 
bounded variation on [0, 1]? 


27. Define f(x) = sin x on [0, 27]. Find two increasing functions h and g for which f = h — gon 
[0, 2a]. 


28. Let f be a step function on [a, 5]. Find a formula for its total variation. 
29. (a) Define 
x* cos(1/x2) ifx#0,xe[—1, 1] 
flxyay * SU) ee O-rel 
0 if x = 0. 


Is f of bounded variation on [—1, 1]? 
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(b) Define 
x* cos(1/x) ifx#0,xe[-1, 1 
a(x) = (I/x) [-1, 1] 
0 ifx =0. 
Is g of bounded variation on [—1, 1]? 


30. Show that the linear combination of two functions of bounded variation is also of bounded 
variation. Is the product of two such functions also of bounded variation? 


31. Let P be a partition of [a, b] that is a refinement of the partition P’. For a real-valued function 
f on [a, b], show that V( f, P’) < V(f, P). 


32. Assume f is of bounded variation on [a, b]. Show that there is a sequence of partitions {P,,} 
of [a, b] for which the sequence {TV( f, P, )} is increasing and converges to TV( f). 


33. Let { f,} be a sequence of real-valued functions on [a, b] that converges pointwise on [a, b] to 
the real-valued function f. Show that 


TV(f) <liminf TV(f,). 


34. Let f and g be of bounded variation on [a, b]. Show that 


TV(f +g) <TV(f)+TV(g) and TV(af) =|alTV(f). 
35. For a and B positive numbers, define the function f on [0, 1] by 


x*sin(1/x®) for0<x<1 
f(x) = _ 
0 for x = 0. 


Show that if a > B , then f is of bounded variation on [0, 1], by showing that f’ is integrable 
over [0, 1]. Then show that if a < f, then f is not of bounded variation on [0, 1]. 


36. Let f fail to be of bounded variation on [0, 1]. Show that there is a point xo in [0, 1] such that 
f fails to be of bounded variation on each nondegenerate closed subinterval of [0, 1] that 
contains xo. 


6.4 ABSOLUTELY CONTINUOUS FUNCTIONS 


Definition A real-valued function f on a closed, bounded interval [a, b] is said to be 
absolutely continuous on [a, b] provided for each € > 0, there is a 5 > 0 such that for every 
finite disjoint collection {(ax, by)}f_, of open intervals in (a, b), 


if S [be — a4] <6, then > |f (de) — fae )| <e. 
k=l i=l 


The criterion for absolute continuity in the case the finite collection of intervals consists 
of a single interval is the criterion for the uniform continuity of f on [a, b]. Thus absolutely 
continuous functions are continuous. The converse is false, even for increasing functions. 
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Example The Cantor-Lebesgue function ¢ is increasing and continuous on [0, 1], but it 
is not absolutely continuous (see also Problems 40 and 48). Indeed, to see that ¢ is not 
absolutely continuous, let n be a natural number. At the n-th stage of the construction of 
the Cantor set, a disjoint collection {[cx, dy]}1<,<2n of 2" subintervals of [0, 1] have been 
constructed that cover the Cantor set, each of which has length (1/3)". The Cantor-Lebesgue 
function is constant on each of the intervals that comprise the complement in [0, 1] of this 
collection of intervals. Therefore, since ¢ is increasing and g(1) — 9(0) = 1, 


> [di — cx] = (2/3)" while >) [o(d) — 9(ce)] =1. 


1<k<2" 1<k<2" 


There is no response to the « = 1 challenge regarding the criterion for ¢ to be absolutely 
continuous. - 


Clearly linear combinations of absolutely continuous functions are absolutely continu- 
ous. However, the composition of absolutely continuous functions may fail to be absolutely 
continuous (see Problems 43, 44, and 45). 


Proposition 7 If the function f is Lipschitz on a closed, bounded interval {a, b], then it is 
absolutely continuous on [a, 5]. 


Proof Let c > 0 be a Lipschitz constant for f on [a, b], that is, 
|f(u) — f(v)| < clu — v| for all u, ve[a, 5]. 


Then, regarding the criterion for the absolute continuity of f, it is clear that 6 = €/c responds 
to any € > 0 challenge. 


There are absolutely continuous functions that fail to be Lipschitz: the function f on 
[0, 1], defined by f(x) = ./x for 0 < x < 1, is absolutely continuous but not Lipschitz (see 
Problem 37). 


Theorem 8 Let the function f be absolutely continuous on the closed, bounded interval 
[a, b]. Then f is the difference of increasing absolutely continuous functions and, in particular, 
is of bounded variation. 


Proof We first prove that f is of bounded variation. Indeed, let 5 respond to the « = 1 
challenge regarding the criterion for the absolute continuity of f. Let P be a partition of 
[a, b] into N closed intervals {[c;, dk} each of length less than 6. Then, by the definition 
of 6 in relation to the absolute continuity of f, it is clear that TV( f[-,,4,]) <1, forl <k <n. 
The additivity formula (19) extends to finite sums. Hence 


N 
TV(F) = DTV Fiey.d)) < 
k=1 


Therefore f is of bounded variation. In view of (23) and the absolute continuity of sums 
of absolutely continuous functions, to show that .f is the difference of increasing absolutely 
continuous functions it suffices to show that the total variation function for f is absolutely 
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continuous. Let « > 0. Choose 6 as a response to the ¢/2 challenge regarding the criterion 
for the absolute continuity of f on [a, b]. Let {(cz, dy )}i_, be a disjoint collection of open 
subintervals of (a, b) for which S7_,[d, — cy] < 6. For 1 < k <n, let Py be a partition of 
[cx, d.]. By the choice of 6 in relation to the absolute continuity of f on [a, 5], 


n 
> TV( flex, de> Px) < €/2. 
k=1 
Take the supremum as, for 1 <k <n, Py vary among partitions of [c;, dj], to obtain 
n 
> TV( fic, dl) < €/2 <e.- 
k=1 


We infer from (21) that, for 1 <k <n, TV( fic,,a,]) = TV fla, a,]) — TV(ffa, ej). Hence 


n n 
if [dy — ck] <8, then Irv fia, a,)) - TV(fla.e,))| <€- (24) 
k=1 k=1 
Therefore the total variation function for f is absolutely continuous on [a, b]. O 


Theorem 9 Let the function f be continuous on the closed, bounded interval [a, b]. Then 
f is absolutely continuous on [a, b] if and only if the family of divided difference functions 
{Diff, f}o<n<1 is uniformly integrable over [a, b]. 


Proof. First assume {Diff, f}o<;<; is uniformly integrable over [a, b]. Let « > 0. Choose 
6 > 0 for which 
| Diff, f| < €/2 if m(E) <6 and0<h <1. 
E 


We claim that 6 responds to the ¢ challenge regarding the criterion for f to be absolutely 
continuous. Indeed, let {(cx, dy )}f_, be a disjoint collection of open subintervals of (a, b) 
for which ¥7_ [dk — cx] < 6. For0 <A <1 and1 <k <n, by (14), 


d,, 
Av», f (dk) — Avy, f (cx) = | Diff, f. 


Ck 


Therefore 


n n dy 
S | Avi f(de) — Avi flax) < S| (Diffs fl = [ Diff, 
k=1 


k=1 “CE 


where E =(;_, (cx, d, ) has measure less than 6. Thus, by the choice of 6, 


n 
S | Ava f (dk) — Avan f(cx)| < €/2 for allO<h <1. 
k=1 
Since f is continuous, take the limit as h > 0* to obtain 


n 


S |f(dke) — f(ce)| < /2<e. 


k=1 
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Hence f is absolutely continuous. 


To prove the converse, suppose f is absolutely continuous. The preceding theorem tells 
us that f is the difference of increasing absolutely continuous functions. We may therefore 
assume that f is increasing, so that the divided difference functions are nonnegative. To 
verify the uniformly integrability of (Diff, f}o<j<1, let € > 0. We must show that there is a 
5 > 0 such that for each measurable subset E of (a, b), 


| ditts ¢ <eitm(e) <dand0<h <1. (25) 
E 


According to Theorem 11 of Chapter 2, a measurable set E is contained in a Gs set G 
for which m(G ~ E) = 0. But every Gs set is the intersection of a descending sequence of 
open sets. Moreover, every open set is the disjoint union of a countable collection of open 
intervals, and therefore every open set is the union of an ascending sequence of open sets, 
each of which is the union of a finite disjoint collection of open intervals. Therefore, by the 
continuity of integration, to verify (25) it suffices to find a 6 > 0 such that for {(cx, dk )}/_, a 
disjoint collection of open subintervals of (a, b), 


| Diff, f < ¢/2 if m(E) <6, where E =|_J(cx, di), andO<h <1. (26) 
E k=1 


Choose 6 > 0 as the response to the ¢/2 challenge regarding the criterion for the absolute 
continuity of f on [a, b+ 1]. By a change of variables for the Riemann integral and 
cancellation, 


v 1 h 
| Diff, f = al g(t) dt, where p(t) = f(v+t)—f(u+t) for0<t<landa<u<v<b. 
u 0 


Therefore, if {(cx, dy )}~_, is a disjoint collection of open subintervals of (a, b), 


f vite p= [ata 


where 


n 


E= UI (cis de) and g(t) = XI (dy +t) — f(cx+t)] for allO <¢ <1. 
k=1 kal 


If [d, — cy] < 6, then, for 0 <¢ <1, > [(d, +t) — (cy +t)] <6, and therefore g(t) <«/2. 
k=1 k=1 
Thus 
1 h 
| Diff, f = | g(t)dt<e/2. 
E h Jo 
Hence (26) is verified for this choice of 6. - 


Remark For a nondegenerate closed, bounded interval [a, b], let F rip, F ac, and F gy denote 
the families of functions on [a, b] that are Lipschitz, absolutely continuous, and of bounded 
variation, respectively. We have the following strict inclusions: 


F tip GC Fac CG Fav. (27) 
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Proposition 7 tells us of the first inclusion, and the second inclusion was established in Theorem 
7. Each of these collections is closed with respect to the formation of linear combinations. 
Moreover a function in one of these collections has its total variation function in the same 
collection. Therefore, by (23), a function in one of these collections may be expresed as the 
difference of two increasing functions in the same collection (see Problem 46). 


37. 


38. 


39. 


40. 


41. 


42. 


43. 


PROBLEMS 
Let f be a continuous function on [0, 1] that is absolutely continuous on [e, 1] for each 
O<e<1. 
(i) Show that f may not be absolutely continuous on [0, 1]. 
(u) Show that f is absolutely continuous on [0, 1] if it is increasing. 


(i) Show that the function f on [0, 1], defined by f(x) = ./x for 0 < x < 1, is absolutely 
continuous, but not Lipschitz, on [0, 1]. 


Show that f is absolutely continuous on [a, b] if and only if for each € > 0, there is a 6 > 0 
such that for every countable disjoint collection {(a;, by )}7°., of open intervals in (a, b), 


Ss If( be) — Flax) < S [be ~ a] <6. 


k=1 k=1 


Use the preceding problem to show that if f is increasing on [a, b], then f is absolutely 
continuous on [a, b] if and only if for each e, there is a 5 > 0 such that for a measurable subset 
E of [a, b}, 
m*( f(E)) <eifm(E) <6. 

Use the preceding problem to show that an increasing absolutely continuous function f 
on [a, b] maps sets of measure zero onto sets of measure zero. Conclude that the Cantor- 
Lebesgue function ¢ is not absolutely continuous on [0, 1] since the function w, defined by 
w(x) =x+ (x) for 0 < x <1, maps the Cantor set to a set of measure 1 (page 52). 


Let f be an increasing absolutely continuous function on [a, b]. Use (i) and (ii) below to 

conclude that f maps measurable sets to measurable sets. 

(i) Infer from the continuity of f and the compactness of [a, b] that f maps closed sets to 
closed sets and therefore maps F;, sets to F, sets. 


(ii) The preceding problem tells us that f maps sets of measure zero to sets of measure zero. 
Show that both the sum and product of absolutely continuous functions are absolutely 
continuous. 


Define the functions f and g on [—1, 1] by f(x) = x3 for -1 <x <1and 


_ 2 cos(m/2x) ifx#0,xe[-1, 1] 
a) = ifx=0. 


(i) Show that both f and g are absolutely continuous on [—1, 1]. 
(ii) For the partition P, = {—1, 0, 1/2n, 1/[2n — 1], ..., 1/3, 1/2, 1} of [—1, 1], examine 
Vi fog, Py). 


(1) Show that f o g fails to be of bounded variation, and hence also fails to be absolutely 
continuous, on [—1, 1]. 
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44. Let f be Lipschitz on R and g be absolutely continuous on [a, b]. Show that the composition 
f o gis absolutely continuous on [a, 5]. 


4S. Let f be absolutely continuous on R and g be absolutely continuous and strictly monotone 
on [a, b]. Show that the composition f © g is absolutely continuous on [a, 5]. 


46. Verify the assertions made in the final remark of this section. 


47. Show that a function f is absolutely continuous on [a, b] if and only if for each € > 0, there is 
a 5 > 0 such that for every finite disjoint collection {(a,, bx )}f_, of open intervals in (a, b), 


SUF (b) — f(ae)]| <eif She - a] <3. 
k=1 k=1 


6.5 INTEGRATING DERIVATIVES: DIFFERENTIATING INDEFINITE INTEGRALS 


Let f be a continuous function on the closed, bounded interval [a, b]. In (14), take a = u and 
b = v to arrive at the following discrete formulation of the fundamental theorem of integral 
calculus: | 


b 
[ ditt. = avs7(0) — Ava (a). 


Since f is continuous, the limit of the right-hand side as h > 0+ equals f(b)— f(a). We now 
show that if f is absolutely continuous, then the limit of the left-hand side as h > 0* equals 


f f’ and thereby establish the fundamental theorem of integral calculus for the Lebesgue 
integral.’ 


Theorem 10 Let the function f be absolutely continuous on the closed, bounded interval 


[a, b]. Then f is differentiable a almost everywhere on (a, b),1 its derivative f' is integrable over 
[a, b], and 


fr =s0)- 40). (28) 
Proof We infer from the discrete formulation of the fundamental theorem of integral 


calculus that 


lim 
i= 


b . . 
| [Ditty / = f(b) - F(a). (29) 


Theorem 8 tells us that f is the difference of increasing functions on [a, b] and therefore, by 
Lebesgue’s Theorem, is differentiable almost everywhere on (a, b). Therefore {Diff, Inf} 
converges pointwise almost everywhere on.(a, b) to f’. On the other hand, according to 
Theorem 9, {Diff,/,, f} is uniformly integrable over [a, b]. The Vitali Convergence Theorem 
(page 95) permits passage of the limit under the integral sign in order to conclude that 


b b b 
na © a ; a n> &© a 
Formula (28) follows from (29) and (30). | O 


4This approach to the proof of the fundamental theorem of integral calculus for the Lebesgue inte- 
gral is taken in a note by Patrick Fitzpatrick and Brian Hunt in which Theorem 9 is proven (see www- 
users.math.umd.edu/~pmf/huntpmf). 
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In the study of calculus, indefinite integrals are defined with respect to the Riemann 
integral. We here call a function f on a closed, bounded interval [a, b] the indefinite integral 
of g over [a, b] provided g is Lebesgue integrable over [a, b] and 


f(x)= g(a)+ [gor all x €[a, b]. (31) 


Theorem 11 A function f on a closed, bounded interval {a, b] is absolutely continuous on 
[a, b] if and only if it is an indefinite integral over [a,, b}. 


Proof First suppose f is absolutely continuous on [a, b]. For each x € (a, b], f is absolutely 
continuous over [a, x] and hence, by the preceding theorem, in the case [a, b] is replaced by 


a. x) . 
+f. 


Thus f is the indefinite integral of f’ over [a, b]. 


Conversely, suppose that f is the indefinite integral over [a, b] of g. For a disjoint 
collection {(ax, bx )}_, of open intervals in (a, b), if we define E = Uj_, (ax, by), then, by 
the monotonicity and additivity over domains properties of the integral, 


[oa <2 py , ‘isl = i ial. (32) 


Let € > 0. Since |g| is integrable over [a, b], according to Proposition 23 of Chapter 4, there 
is a 5 > 0 such that f,, |g| < if E C[a, b] is measurable and m(E) < 6. It follows from (32) 
that this same 6 responds to the € challenge regarding the criterion for f to be absolutely 
continuous on [a, 5}. 


n 


% Li(bx) — flae)| = -¥ 
k= 


k=1 


Corollary 12 Let the function f be monotone on the closed, bounded interval [a, b|. Then f 
is absolutely continuous on [a, b| if and only if 


b 
[ f=10)-10). - (33) 
a 
Proof Theorem 10 is the assertion that (33) holds if f is absolutely continuous, irrespective 


of any monotonicity assumption. Conversely, assume f is increasing and (33) holds. Let x 
belong to [a, b]. By the additivity over domains of integration, 


b x b 
= | f-170)-se=|/ fr) stan} +4f f-170)- so} 


According to Corollary 4, 


X b 
[£-We)- Fa) sand fF -17(0)- FM] s 0 
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If the sum of two nonnegative numbers is zero, then they both are zero. Therefore 


f(x) = f(a) + | ft 


Thus f is the indefinite integral of f’. The preceding theorem tells us that f is absolutely 
continuous. a 


Lemma 13 Let f be integrable over the closed, bounded interval [a, b]. Then 


f(x) =0 for almost all x €[a, b] (34) 
if and only if 
[ f =0 for all (x;, x2)  [a, 5]. (35) 


Proof Clearly (34) implies (35). Conversely, suppose (35) holds. We claim that 
| f =0 for all measurable sets E C [a, 5]. (36) 
E 


Indeed, (36) holds for all open sets contained in (a, b) since integration is countably additive 
and every open set is the union of countable disjoint collection of open intervals. The 
continuity of integration then tells us that (36) also holds for all Gs sets contained in (a, b) 
since every such set is the intersection of a countable descending collection of open sets. But 
every measurable subset of [a, b] is of the form G ~ Ep, where G is a Gs subset of (a, b) and 
m( Eo) = 0 (see page 40). We conclude from the additivity over domains of integration that 
(36) is verified. Define 


E* = {xe [a, b]| f(x) >=0} and ET = {xe [a, b]| f(x) <0}. 


These are two measurable subsets of [a, b] and therefore, by (36), 


[r= 7=0and fcn=-[ r=0 


According to Proposition 9 of Chapter 4, a nonnegative integrable function with zero integral 
must vanish almost everywhere on its domain. Thus f* and f~ vanish almost everywhere 
on [a, b] and hence so does f. O 


Theorem 14 Let f be integrable over the closed, bounded interval [a, b]. Then 


a 
dx 


[ / = f(x) for almost all x € (a, b). (37) 


Proof Define the function F on {a, b] by F(x) = ff for all x €[a, b]. Theorem 18 tells us 
that since F is an indefinite integral, it is absolutely continuous. Therefore, by Theorem 10, F 
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is differentiable almost everywhere on (a, b) and its derivative F’ is integrable. According to 
the preceding lemma, to show that the integrable function F’ — f vanishes almost everywhere 
on [a, b] it suffices to show that its integral over every closed subinterval of [a, b] is zero. 
Let [x;,, x2] be contained in [a, b]. According to Theorem 10, in the case [a, b] is replaced 
by [x1, x2], and the linearity and additivity over domains properties of integration, 


[ow -n=[or- [r= R02)- Fn) [9 
=[or-[os-[ oreo 


A function of bounded variation is said to be singular provided its derivative vanishes 
almost everywhere. The Cantor-Lebesgue function is a non-constant singular function. We 
infer from Theorem 10 that an absolutely continuous function is singular if and only if it is 
constant. Let f be of bounded variation on [a, b]. According to Corollary 6, f’ is integrable 
over [a, b]. Define 


O 


=-[ f’ and h(x) = s(x) - fo f’ for all x € [a, b], 


so that 
f =g+hon fa, dj. 
According to Theorem 11, the function g is absolutely continuous. We infer from Theorem 14 
that the function h is singular. The above decomposition of a function of bounded variation 
f as the sum g +h of two functions of bounded variation, where g is absolutely continuous 
and h is singular, is called a Lebesgue decomposition of f. 
ae 
PROBLEMS 
48. The Cantor-Lebesgue function ¢ is continuous and increasing on [0, 1]. Conclude from 


Theorem 10 that ¢ is not absolutely continuous on [0, 1]. Compare this reasoning with that 
proposed in Problem 40. 


49. Let f be continuous on [a, b] and differentiable almost everywhere on (a, b). Show that 
b 
[ £=F0)-1@) 
if and only if 
b Y 
[ [im itun] = tim, Dit]. 


50. Let f be continuous on [a, b] and differentiable almost everywhere on (a, b). Show that if 
{Diff /, f} is uniformly integrable over [a, b], then 


b 
[ #=F0)- Fo. 
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51. Let f be continuous on [a, b] and differentiable almost everywhere on (a, b). Suppose there 
- is a nonnegative function g that is integrable over [a, b] and 


Ditty, f| < ga.e. on [a, b] for all n. 


Show that , 
[1 = $0) - Fa. 


52. Let f and g be absolutely continuous on [a, 5]. Show that 
b b 
[ #-8 = 10)s(0)- sa)s(a)- f F'-e 


53. Let the function f be absolutely continuous on [a, b]. Show that f is Lipschitz on [a, 5] if 
and only if there is a c > 0 for which | f’| < ca.e.on [a, 5]. 


54. (i) Let f be a singular increasing function on [a, b]. Use the Vitali Covering Lemma to show 
that f has the following property: Given « > 0,5 > 0, there is a finite disjoint collection 
{(ax, be )}¢_, of open intervals in (a, b) for which 


Dl ~ ay] <8 and DU(b0) ~ f(ax)] > f(b) — fla) 
= =1 


(11) Let f be an increasing function on [a, b] with the property described in part (i). Show 
that f is singular. 


(iii) Let {f,} be a sequence of singular increasing functions on [a, b] for which the series 
Yh fn(x) converges to a finite value for each x € [a, b]. Define 


NN f(x) = > fala for x €[a, b]. 


Show that f is also singular. 


55. Let f be of bounded variation on [a, b], and define v(x) = TV( fia, x1) for all x € [a, 5]. 
(i) Show that | f’| < v’ a.e on [a, ], and infer from this that 


b 
| If|<TV(f). 


(ii) Show that the above is an equality if and only if f is absolutely continuous on [a, 5]. 
(iii) Compare parts (i) and (ii) with Corollaries 4 and 12, respectively. 


56. Let g be strictly increasing and absolutely continuous on [a, 5]. 
(i) Show that for any open subset O of (a, b), 


m(g(O)) = [ g(x) dx 
(ii) Show that for any Gs subset E of (a, b), 


m(g(E)) = [ of (x) dx. 


57. 


58. 


59. 


60. 
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(ii) Show that for any subset E of [a, b] that has measure 0, its image g( E) also has measure 
0, so that 


m(s(E))=0= | e(s)as 
(iv) Show that for any measurable subset A of [a, 5], 


m(e(A)) = f o'(x)ab 
(v) Let c= g(a) and d = g(b). Show that for any simple function ¢ on [c, d], 


d b 
| e(y)dy = | o(g(x))g(x) dx. 


a 


(vi) Show that for any nonnegative integrable function f over [c, d], 
. d b 
[ rora=f recor wa 
c a 


(vii) Show that part (i) follows from (vi) in the case that f is the characteristic function of O 
and the composition is defined. 


Is the change of variables formula in the last part of the preceding problem true if we just 
assume g Is increasing, not necessarily strictly? | 


Construct an absolutely continuous strictly increasing function f on [0, 1] for which f’ = 0 on 
a set of positive measure. (Hint: Let E be the relative complement in [0, 1] of a generalized 
Cantor set of positive measure and f the indefinite integral of yz. See Problem 39 of Chapter 
2 for the construction of such a Cantor set.) 


For a nonnegative integrable function f over [c, d], and a strictly increasing absolutely 
continuous function g on [a, b] such that g([a, b]) C[c, d], is it possible to justify the change 
of variables formula 


g(b) b 
[ sodav= fF eye e)an 
g(a) a 


by showing that 


(x) x 
ill / ()ds | J (oneal = 0 for almost all x € (a, b)? 


Let f be absolutely continuous and singular on [a, b]. Show that f is constant. Also show 
that the Lebesgue decomposition of a function of bounded variation is unique if the singular 
function is required to vanish at x = a. 
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6.6 CONVEX FUNCTIONS 


Throughout this section (a, ) is an open interval that may be bounded or unbounded. 


Definition A real-valued function ¢ on (a,b) is said to be convex provided for each pair of 
points x1, x2 in (a, b) and each A withO < A <1, 


(Ax + (1—A)x2) < Ap(x1) + (1 —A)G( 22). (38) 


If we look at the graph of g, the convexity inequality can be formulated geometrically by 
saying that each point on the chord between (x1, o(x1)) and (x2, y(x2)) is above the graph 
of ¢. 
Observe that for two points x; < x2 in (a, b), each point x in (x1, x2) may be expres- 
sed as 
x = Ax, +(1—A)x2 where A = canes 
X2— X1 


Thus the convexity inequality may be written as 


o(x) < 


2 Jon) +|-=* | ol) fora <x <x in (a, b). 
x2 — X4 *2~*1 


Regathering terms, this inequality may also be rewritten as 


g(x) —9(x1) _ e(42) - o(*) 


< for x1 <x < x2 in (a, b). (39) 
X—X4 x2 - 


Therefore convexity may also be formulated geometrically by saying that for x] <x <x», the 
slope of the chord from (x;, g(x1)) to (x, o(x)) is no greater than the slope of the chord 
from (x, o(x)) to (x2, o(x2)). 


Proposition 15 /f ¢ is differentiable on (a, b) and its derivative g’ is increasing, then 9 is 
convex. In particular, is convex if it has a nonnegative second derivative g” on (a, b). 


Proof Let x1, x2 be in (a, b) with x; < xp, and let x belong to (x,, x2). We must show that 


e(x)—e(n1) — 9(x2) - (x) 
x-X 7 x2—-xX 


However, apply the Mean Value Theorem to the restriction of » to each of the intervals 
[x1, x] and [x, x2] to choose points c; € (x1, x) and cz € (x, x2) for which 


1 \ . 9(%2) — (x) 
X—X1 (¢2) = X2—-X 


Thus, since ¢’ is increasing, 


Hs) —ol8t) « o(e) < (eq) = I=) : 
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Example Each of the following three functions is convex since each has a nonnegative 
second derivative: 
p(x) =x? on (0, 00) for p> 1; o(x) = e* on (—00, 00); (x) = In(1/x) on (0, 00). 
The following final geometric reformulation of convexity will be useful in the estab- 
lishment of differentiability properties of convex functions. 


The Chordal Slope Lemma _ Let ¢ be convex on (a, b). If x1 <x < x2 belong to (a, b), then 
for py = (x1, 9(m1)), P=(%,9(x)), po = (x2, 9(%2)), 


Slope of pip < slope of Di pz < slope of Dpz. 


Proof Regather terms in the inequality (39) to rewrite it in the following two equivalent 


forms: 
o(x1)~e(x) _ (22) ~ (1) for 5, <a < ap in (a,b) 
x1 -X x2 — X41 
(42) = elm) < 932) = 0%) sor x < ay in (a, b). O 
x2 - x1 x2—X 


For a function g on an open interval (a, b), and point xq € (a, b), if 


8(xo +h) — g(xo) 


exists and is finite, 
h>0,h<0 h 


we denote this limit by g’(xp ) and call it the left-hand derivative of g at xo. Similarly, we 
define g’(xj ) and call it the right-hand derivative of g at xo. Of course, g is differentiable at 
xo if and only if it has left-hand and right-hand derivatives at xg that are equal. The continuity 
and differentiability properties of convex functions follow from the following lemma, whose 
proof follows directly from the Chordal Slope Lemma. 


Lemma 16 Let ¢ be a convex function on (a, b). Then o has left-hand and right-hand 
derivatives at each point x € (a, b). Moreover, for points u,v in (a, b) with u < v, these 
one-sided derivatives satisfy the following inequality: 


g(u-) <¢(u") < ores <¢'(v-) <¢(v"). (40) 


132. Chapter6 Differentiation and Integration 


Corollary 17 Let ~ be a convex function on (a,b). Then is Lipschitz, and therefore 
absolutely continuous, on each closed, bounded subinterval [c, d] of (a, b). 


Proof According to the preceding lemma, for c < u < v <d, 


g'(c")<¢(u") < av) ele) <¢'(v-)<¢(d) (41) 


and therefore 


lp(u) — p(v)| < M|u — v| for all u, v € [c, d], 


where M = max{|g’(c*)|, |p’(d~ )|}. Thus the restriction of @ to [u, v] is Lipschitz. A 
Lipschitz function on a closed, bounded interval is absolutely continuous. O 


We infer from the above corollary and Corollary 6 that any convex function defined 
on an open interval is differentiable almost everywhere on its domain. In fact, much more 
can be said. 


Theorem 18 Let ~ be a convex function on (a, b). Then o is differentiable except at a 
countable number of points and its derivative g' is an increasing function. 


Proof We infer from the inequalities (40) that the functions 
xs f’(x~) and x f’(xT) 


are increasing real-valued functions on (a, b). But, according to Theorem 1, an increasing 
real-valued function is continuous except at a countable number of points. Thus, except 
on a countable subset C of (a, b), both the left-hand and right-hand derivatives of g are 
continuous. Let x9 belong to (a, b)~C. Choose a sequence {x,} of points greater than 
xq that converges to xo. Apply Lemma 16, with x9 = u and x, = v, and take limits 
to conclude that 
(x) Se'(xg) Ss o'(x9)- 

Then ¢'(x9 ) = ¢'(xg) so that ¢ is differentiable at x. To show that g’ is an increasing 
function on (a, b) ~C, let u, v belong to (a, b) ~C with u < v. Then by Lemma 16, 


g(u) < PHO) < oy) . 

Let o be a convex function on (a,b) and xg belong to (a,b). For a real number m, 
the line y = m(x — xo) + ¢(x0), which passes through the point (xo, o(x0)), is called a 
supporting line at xo for the graph of ¢ provided this line always lies below the graph of ¢, 
that is, if 

p(x) => m(x — x9) + (x0) for all x € (a, b). 

It follows from Lemma 16 that such a line is supporting if and only if its slope m lies between 
the left- and right-hand derivatives of y at xo. Thus, in particular, there is always at least one 
supporting line at each point. This notion enables us to give a short proof of the following 
inequality: 
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Jensen’s Inequality Let o be a convex function on (—00, 00), f an integrable function over 
(0, 1], and g o f also integrable over [0,1]. Then 


1 1 
(f pisyas) = (eo saya (42) 


Proof Define a = fh f(x)dx. Choose m to lie between the left-hand and right-hand 
derivative of ¢ at the point a. Then y = m(t — a) + g(a) is the equation of a supporting line 
at (a, o(a)) for the graph of o. Hence 


g(t) > m(t-—a)+ (a) for allteR. 


Since f is integrable over [0, 1], it is finite a.e.on [0, 1] and therefore, substituting f(x) fort 
in this inequality, we have 


o( f(x)) = m( f(x) — a) + o(a) for almost all x €[0, 1]. 


Integrate across this inequality, using the monotonicity of the Lebesgue integral and the 
assumption that both’/ and  o f are integrable over [a, b], to obtain 


1 1 
| o( f(x)) dx > | Im( f(x) — a) +9(a)] dx 
0 0 


=m\ [s(0) 40a 


A few words regarding the assumption, for Jensen’s Inequality, of the integrability of 
go f over [0, 1] are in order. We have shown that a convex function is continuous and 
therefore Proposition 7 of Chapter 3 tells us that the composition ¢ o f is measurable if ¢ is 
convex and f is integrable. If yo f is nonnegative, then it is unnecessary to assume the go f 
is integrable since equality (42) trivially holds if the right-hand integral equals +o. In the 
case yo f fails to be nonnegative, if there are constants c, and cz for which 


+ (a) = 9(a). O 


lp(x)| < cy +2 |x| for all x ER, (43) 


then we infer from the integral comparison test that 9 o f is integrable over (0, 1] if f is. In 
the absence of the growth assumption (43), the function » o f may not be integrable over 
[0, 1] (see Problem 71). 


PROBLEMS 
61. Show that a real-valued function ¢ on (a, b) is convex if and only if for points x1, ..., x, in 
(a, b) and nonnegative numbers A}, ..., A, such that S;_, Ax =1, 


(3 Ak x) < s Ak p( Xx). 
k=l i=l 


Use this to directly prove Jensen’s Inequality for f a simple function. 
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62. 


63. 


69. 


70. 


71. 


Show that a continuous function on (a, b) is convex if and only if 


~ (“5*) < ela) 669) for all x1, x2 € (a, b). 


A function on a general interval J is said to be convex provided it is continuous on J and (38) 
holds for all x1, x2 € J. Is a convex function on a closed, bounded interval [a, b] necessarily 
Lipschitz on [a, b]? 


. Let g have a second derivative at each point in (a, b). Show that g is convex if and only if 9” 


iS nonnegative. 


. Suppose a > 0 and b > 0. Show that the function g(t) = (a+ bt)? is convex on [0, 00) for 


1<p<o. 


. For what functions ¢ is Jensen’s Inequality always an equality? 
67. 
68. 


State and prove a version of Jensen’s Inequality on a general closed, bounded interval [a, 5]. 
Let f be integrable over [0, 1]. Show that 


1 1 
of isd | exp( f(x)) dx 


Let {a,,} be a sequence of nonnegative numbers whose sum is 1 and {£,,} a sequence of positive 


numbers. Show that 
oO oO 
Il ¢ nn Ss >> On on. 


Let g bea positive measurable function on [0, 1]. Show that log ( h g(x) dx) > ik log (g(x)) dx 
whenever each side is defined. 


(Nemytskii) Let » be a continuous function on R. Show that if there are constants for which 
(45) holds, then ¢ o f is integrable over [0, 1] whenever f is. Then show that if go f is 
integrable over [0, 1] whenever f is, then there are constants c, and cz for which (45) holds. 
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Completeness of the real numbers may be formulated by asserting that if {a,} is a sequence 
of real numbers for which lim, m-+90 |an — @m| = 0, there is a real number a for which 
limn—oo |4n — a| = 0. There is a corresponding completeness property for the Lebesgue 
integral. For E measurable and 1 < p < oo, define L?(E) to be the collection of measurable 
functions f for which |f|? is integrable over E; thus L'(E) is the collection of integrable 
functions. If {f,} is a sequence of functions in L?(E) for which 


lim [ it ful? =0, 
n,m>oo E 


there is a function f belonging to L?(E) for which 


Jim i fn — FI? =0. 
This is the Riesz-Fischer Theorem, the centerpiece of this chapter. A collection F of functions 
in L?(E) is said to be dense in L?(E) provided for each g in L?(E) and € > 0, there is a 
function f belonging to F for which /{ rlg — f|? <«¢. We prove that there is a countable 
collection of functions that is dense in L?(E), and that both the continuous functions and 
the simple functions are dense in L?(E). The proofs of the Riesz-Fischer Theorem and the 
denseness results are framed in the context of normed linear spaces of functions. In order 
to construct this frame we prove two basic inequalities, Hdlder’s Inequaliy and Minkowski’s 
Inequality. 


7.1 NORMED LINEAR SPACES 


Throughout this chapter E denotes a measurable set of real numbers. Define F to be the 
collection of all measurable extended real-valued functions on E that are finite a.e. on E. 
Define two functions f and g in F to be equivalent, and write f = g, provided 


f(x) = g(x) for almost all x € E. 
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This is an equivalence relation, that is, it is reflexive, symmetric, and transitive. Therefore 
it induces a partition of F into a disjoint collection of equivalence classes, which we denote 
by #/~. There is a natural linear structure on F/~: given two functions f and-g in f, their 
equivalence classes [ f] and [g] and real numbers a and B, we define the linear combination 
a-[ f]+B-[g] to be the equivalence class of the functions in F that take the value a f (x)+B2(x) 
at points x in E at which both f and g are finite. These linear combinations are properly 
defined in that they are independent of the choice of representatives of the equivalence 
classes. The zero element of this linear space is the equivalence class of functions that vanish 
a.e. on E. 

A subset of a linear space is called a subspace provided it is closed with respect to the 
formation of linear combinations. There is a natural family {L?( E)}; < p<oo of subspaces of 
F/~. For 1 < p< oo, we define L?( E) to be the collection of equivalence classes [f] for 


which 
| Lf|? < oo. 
E 


This is properly defined since if f = g, then [ Elfl? = f ; lgl?. For any two numbers a and 5, 
la + B| < |a| + |b] < 2 max{[al, |d]}, 


and hence 
la + DI? < 2? {la|? + [b|?}. (1) 


We infer from this inequality, together with the linearity and monotonicity of integration, 
that if [f] and [g] belong to L?(E), so also does the linear combination a - [f] + B - [g]. 
Therefore L?(E) is a linear space. Of course, L'(E) comprises equivalence classes of 
integrable functions. 

We call a function f € F essentially bounded provided there is some M > 0, called an 
essential upper bound for /, for which 


|f(x)| < M for almost all x € E. 


We define L®(E) to be the collection of equivalence classes [f] for which f is essentially 
bounded. It is easy to see that this is properly defined and L™(£) also is a linear subspace 
of F/ = 

For simplicity and convenience, we refer to the equivalence classes in ¥ /~ as functions 
and denote them by f rather than [ f]. Thus to write f = g means that f — g vanishes a.e. on 
E. This simplification imposes the obligation to check consistency when defining concepts 
for the L?(E) spaces. For instance, it is meaningful to assert that a sequence { f,} in L?(E£) 
converges pointwise a.e. on F to a function f € L?( EF) since if g, = fy, for alln and f = g, 
then, since the union of a countable collection of sets of measure zero also is of measure 
zero, the sequence {g,} also converges pointwise a.e. on E to g. To state that a function f 
in L?[a, b] is continuous means that there is a continuous function that agrees with f a.e. 
on [a, b]. Since complements of sets of measure zero are dense in R, there is only one such 
continuous function and it is often convenient to consider this unique continuous function 
as the representative of [ f]. 

In the late nineteenth century it was observed that while real-valued functions of one or 
several real variables were the rudimentary ingredients of classical analysis, it is also useful 
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to consider real-valued functions that have as their domain linear spaces of functions: such 
functions were called functionals. It became apparent that in order to systematically study 
such fruitful concepts as convergence of a sequence (and maintain the linearity property of 
convergent sequences) and thereby define the concept of continuous functional, it would 
be useful to extend the concept of absolute value from the real numbers to general linear 
spaces.! The notion’that emerged from these investigations is called a norm. 


Definition Let X be a linear space. A real-valued functional || - || on X is called a norm 
provided for each f and g in X and each real number a, 


(The Triangle Inequality) 
lf+ all < Ifill + lel 


(Positive Homogeneity) 
lofi] = lati fll 


(Nonnegativity) 
I fll = O and || f || = 0 if and only if f = 0. 


By a normed linear space we mean a linear space together with a norm. If X is a linear 
space normed by || - || we say that a function in X is a unit function provided || {|| = 1. For 
any feX, f #0, the function f/|| f|| is a unit function: it is a scalar multiple of f which we 
call the normalization of f. 


Example (the Normed Linear Space L/(E)) For a function f in L'(E), define 
wf = fife 
JE 


Then |] - ||, is a norm on L'(E). Indeed, for f, g€ L'(E), since f and g are finite a.e. on E, 
we infer from the triangle inequality for real numbers that 


Lf + el < |f| + |g| ae. on E. 


Therefore, by the monotonicity and linearity of integration, 


Ite =| f+ < fusi+tel=f iri+ fii fla + lilt. 


Clearly, || - |l1 is positively homogeneous. Finally, if f ¢ L'(E) and || f ||; = 0, then f = Oa. 
on E. Therefore [f] is the zero element of the linear space L! (E) CF/-x, that is, f = 0. 


Example (the Normed Linear Space L®(E)) For a function f in L©(£E), define || f|loo to 
be the infimum of the essential upper bounds for f. We call || f||.o the essential supremum 


1We will see later that continuity can also be examined in relation to metric structures, or, more generally, 
topological structures, on a domain and range of a mapping. 


138 Chapter7 TheLP Spaces: Completeness and Approximation 


of f and claim that || - [lo is a norm on L™(E). The positivity and positive homogeneity 
properties follow by the same arguments used in the preceding example. To verify the 
triangle inequality, we first show that || f||oo is an essential upper bound for f on E, that is, 


Ifl < Ilflloo a.e. on E. (2) 
Indeed, for each natural number n, there is a subset EZ, of E for which 


fl < I flloo + 1/n on E~ E, and m(E,) = 0. 


Hence, if we define Ex. = UP, En, 
fl < If lloo on E~ Ego and m( Eg.) = 0. 


Thus the essential supremum of f is the smallest essential upper bound for f, that is, (2) 
holds. Now for f, g€¢L™(E), 


lf(x) + a(x)1 < IF (x)| + 19(x)1 < I flloo + Wl8lleo for almost all x € E. 


Therefore, || flloo + Ilglloo is an essential upper bound for f + g and hence 
If + 8lloo <= IF lloo + Illoo- 


Example (The Normed Linear Spaces ¢; and £..) There is a collection of normed linear 
spaces of sequences that have simpler structure but many similarities with the L?( E) spaces. 
For 1 < p < oo, define £? to be the collection of real sequences a = (a1, a2,...) for which 


00 
S lax |? <0. 
k=1 


Inequality (1) shows that the sum of two sequences in £? also belongs to £? and clearly a real 
multiple of a sequence in €? also belongs to 2?. Thus £? is a linear space. We define 2~ to be 
the linear space of real bounded sequences. For a sequence a = (aj, a,...) in é', define 


oO 
ada = > lax. 
k=1 
This is anorm on @!. Fora sequence {a;} in €~, define 
ak} lloo = SUP 1<k<00lax. 


It is also easy to see that || - ||,. is anorm on 2%. 


Example (The Normed Linear Space C[a, b]) Let [a, b] be a closed, bounded interval. 
Then the linear space of continuous real-valued functions on [a, b] is denoted by C[a, D]. 
Since each continuous function on [a, b] takes a maximum value, for € C[a, b], we can define 


Il f Ilmax = ele | If(x)I.- 


We leave it as an exercise to show that this defines a norm that we call the maximum norm. 
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PROBLEMS 


b 
If = | fl 


Show that this is a norm on C[a, b]. Also show that there is no number c > 0 for which 


1. For f in C[a, b], define 


Il fllmax < cll fli for all f in C[a, 5], 
but there is ac > 0 for which 


Il fll1 < cll fllmax for all f in C[a, 5]. 


2. Let X be the family of all polynomials with real coefficients defined on R. Show that this 
is a linear space. For a polynomial p, define ||p|| to the sum of the absolute values of the 
coefficients of p. Is this a norm? 


3. For f in L'[a, b], define || f || = f ° x?| f (x)| dx. Show that this is a norm on L}{a, 5]. 
4. For f in L™[a, b], show that 


| loo = min{M | m {xin [a, b]| Lf(x)1>M} =0} 
and if, furthermore, f is continuous on [a, b], that 
Il Flloo = If Ilmax- 


5. Show that £~ and ¢! are normed linear spaces. 


7.2 THE INEQUALITIES OF YOUNG, HOLDER, AND MINKOWSKI 


In the preceding section we introduced the linear spaces L?(E) for 1 < p < oo and Ea 
measurable set of real numbers. In the cases p = 1 and p = 00, we defined a norm on these 
spaces. We now define a norm on L?( E) for 1 < p< oo. 


Definition For E a measurable set, 1 < p < oo, and a function f in L?(E), define 


lfllp = furl 


We will show that the functional || - ||, is a norm on L?(£). Indeed, positive homogeneity is 
clear. Moreover, according to Proposition 9 of Chapter 4, || f |p = 0 if and only if f vanishes 
a.e. on E. Therefore [f] is the zero element of the linear space L'(E) C F/=, that is, f = 0. 
It remains to establish the Triangle Inequality, that is, to show that 


lf + sllp <Ilfllp + Ilgilp for all f, gin L?(E). 


This inequality in not obvious. It is called Minkowski’s Inequality. 
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Definition The conjugate of a number p (1, 00) is the number q = p/(p—1), which is the 
unique number q € (1, oo) for which 


P 4 
The conjugate of 1 is defined to be oo and the conjugate of oo defined to be 1. 


Young’s Inequality For 1 < p < 00, q the conjugate of p, and any two positive numbers a 
and b, 


Proof The function g, defined by g(x) = [1/p]x?+1/q—x for x>0, has a positive derivative 
on (1, oo), a negative derivative on (0, 1), and takes the value 0 at x = 1. The function g 
therefore is nonnegative on (0, oo), that is, 


x < [1/p|x? +1/qifx>0. 

In particular, 
a 
xo < [1/p)xp +1/qifx0= peal’ 

However, this inequality is equivalent to Young’s Inequality, since, because p(q —1) = q, it 
is the inequality obtained by dividing each side of Young’s Inequality by b?. 
Theorem 1 Let E be a measurable set, 1 < p < 00, and q the conjugate of p. If f belongs to 
L?(E) and g belongs to L4( E), then their product f - g is integrable over E and 
Holder’s Inequality 


fife < Ifllp - gla. : (3) 


Moreover, if f #0, the function” f* = Ifllp? -sen( f)-|f |?! belongs to L4(X, w), 
[tre I fllp and || f"llq = 1. (4) 


Proof First consider the case p = 1. Then Hdlder’s Inequality follows from the monotonicity 
of integration and the observation (2) that || f|loo is an essential upper bound for f on E. 
Observe that since f* = sgn( f ), (4) holds with p = 1, g = oo. Now consider p > 1. Assume 
f #0 and g #0, for otherwise there is nothing to prove. It is clear that if Hdlder’s Inequality 
is true when f is replaced by its normalization f/'|| f || and g is replaced by its normalization 
g/\lgllq, then it is true for f and g. We therefore assume that || fllp = Ilgll, = 1, that is, 


| fl? =1and | lel? = 1, 
E E 


The function sgn( f) takes the value 1 if f(x) > 0 and —1 if f(x) <0. Therefore sgn( f)- f = |f| ae. on E 
since f is finite a.e. on E. 
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in which case Hélder’s Inequality becomes 


fif-est. 


Since | f|? and |g|? are integrable over E, f and g are finite a.e. on E. Thus, by Young’s 
Inequality, 


fl? — lel? 
lfgl= Ifl-lgl < pt g eon. 


We infer from the linearity of integration and the integral comparison test that f - g is 


integrable over E and 
1 1 1 1 
fir-siss fise+s [iet== += =1 
E PJE QJE P 4 


It remains to prove (4). Observe that 


f-f*=(flp? -Ifl? ae. on E. 
Therefore 
i f- f= sy. | fl? = WF? PIE = Wf lp. 
E 
Since q(p—1) = p, I f*llq =1. | U 
It is convenient, for feL?(E), f #0, to call the function f* defined above the 
conjugate function of /. 


Minkowski’s Inequality Let E be a measurable set and 1 < p < oo. If the functions f and g 
belong to L?( E), then so does their sum f + g and, moreover, 


lf+ ally < Ifllp + lellp- 


Proof In the preceding section we considered the cases p = 1 and p = ov. So we here 
consider the case pe (1, co). We already inferred from (1) that f + g belongs to L?(E). 
Assume f + ¢ #0. Consider ( f + g)*, the conjugate function of f + g. We infer from the 
linearity of integration and Hélder’s Inequality that 


If tellp= | (f+8)-(F +8)" 
=| p(steys |g (s+8)" 
E E 


< lf llp ; IC f+2)"ll + llellp ; If +3)" llq 
= |lfllp + llallp. 
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The special case of Hélder’s Inequality when p = g = 2 has its own name. 


The Cauchy-Schwarz Inequality Let E be a measurable set and f and g measurable functions 
on E for which f? and g” are integrable over E. Then their product f - g also is integrable over 


E and fun Lelfe 


Corollary 2 Let E be a measurable set and 1 < p < oo. Suppose F is a family of functions in 
L?(E) that is bounded in L?( E) in the sense that there is a constant M for which 


lf llp < M for all f in F. 
Then the family F is uniformly integrable over E. 


Proof Let « > 0. We must show there is a 6 > 0 such that for any f in F, 
| |f| < €if A C E is measurable and m( A) <6. 
A 


Let A be a measurable subset of E of finite measure. Consider L?( A) and L4( A) where q is 
the conjugate of p. Define g to be identically equal to 1 on A. Since m(A) < oo, g belongs 
to L7(A). We infer from Holder’s Inequality, applied to this g and the restriction of f to A, 


that 
n= [uves| fine “fe , 
But for all f in F, 
i ae <| i we) <M and | i of) = im Ay 


Therefore, for all f in F, 
1 
fir <M -[m(A)]"". 


Therefore for each e > 0, 6 = [e/ mM]! responds to the ¢ challenge regarding the criterion for 
F to be uniformly integrable. LJ 


Corollary 3. Let E be a measurable set of finite measure and 1 < p, < p2 < ow. Then 
L?2(E) C L?!(E). Furthermore, 


I fll, Sell flip, for all f in L”(E), (5) 


P2-PY 4 
where c =[m(E)] 1 if pp <0o andc = [m(E)]|”! if pp = 00 
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Proof We leave the case p2 = oo as an exercise. Assume p < oo. Define p = po/ p; > 1 and 
let g be the conjugate of p. Let f belong to L??( E). Observe that f?! belongs to L?( E) and 
& = XE belongs to L7( E) since m(E) < oo. Apply Hélder’s Inequality. Then 


1/q y 
Pr Pp, Pa, q _ p q 
fuse [ise g<IfiP Je WSUS [m(E)]. 


Take the 1/p; power of each side to obtain (5). - O 


Example In general, for E of finite measure and 1 < p; < pp < oo, L??(E) is a proper 
subspace of L?!(£). Forinstance, let E = (0, 1] and f be defined by f(x) = x* for 0<x < 1, 
where —1/p; <a< —1/po. Then fe L?!(E)~L”(E). 


Example In general, for £ of infinite measure, there are no inclusion relationships among 
the L?(E) spaces. For instance, for E = (1, oo) and f defined by 


x2 
= —__ >1 
f(x) i+inx forx > 1, 
f belongs to L?(£) if and only if p = 2. 
PROBLEMS 


6. Show that if Hdlder’s Inequality is true for normalized functions it is true in general. 

7. Verify the assertions in the above two examples regarding the membership of the function f 
in L?(E). 

8. Let f and g belong to L*( £). From the linearity of integration show that for any number A, 


ef pal rer [P= foarreyrzo. 


From this and the quadratic formula directly derive the Cauchy-Schwarz Inequality. 
9. Show that in Young’s Inequality there is equality if and only if a = b = 1. 


10. Show that in Hélder’s Inequality there is equality if and only if there are constants a and , 
not both zero, for which 
: al f|? = Blg|? ae. on E. 


11. For a point x = (x1, x2,...,%,) in R", define 7, to be the step function on the interval 
[1, n + 1) that takes the value x; on the interval [k,k + 1), for 1 < k <n. For p > 1, define 
xl > = 7x llp, the norm of the function 7, in L?[1, n + 1). Show that this defines a norm on 
R". State and prove the Hélder and Minkowski Inequalities for this norm. 


12. For 1 < p< oo and a sequence a = (aj,ap,...) € £?, define T, to be the function on the 
interval [1, oo) that takes the value a; on [k, k +1), fork =1,2,.... Show that 7, belongs 
to L?[1, oo) and that |la||, = ||Tal|p. Use this to state and prove the Hélder and Minkowski 
Inequalities in @?. 

13. Show that if f is a bounded function on E that belongs to L?!( E), then it belongs to L”?(E) 
for any p2 > pj. 
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14. 


15. 
16. 
17. 
18. 
19. 


20. 


21. 


22. 


Show that if f(x) = In(1/x) for x €(0, 1], then f belongs to L?(0, 1] for all 1 < p < 00 but 
does not belong to L™(0, 1]. 


Formulate and prove an extension of Hélder’s Inequality for the product of three functions. 
Suppose that { f;,} is bounded in L![0, 1]. Is { f,} uniformly integrable over [0, 1]? 

For 1 < p < oo, suppose that { f,} is bounded in L?(R). Is { f,} tight? 

Assume m(E) < oo. For f ¢ L®(E), show that lim, oo Il fllp = II flloo- 

For 1 < p < 00, q the conjugate of p, and f « L?( E), show that 


Ifll,p= max | f-8 
p ge LIE), |ighg<l JE 


For 1 < p < 00, q the conjugate of p, and f € L?( E), show that f = 0 if and only if 


| f-g=Ofor all ge L7(E£). 
E 


For 1 < p < ©, find the values of the parameter A for which 


if fe 0 for all fe L?[0, 1]. 


Jim. eA 


(Riesz) For 1 < p < oo, show that if the absolutely continuous function F on [a, b] is the 
indefinite integral of an L?[a, b] function, then there is a constant M > 0 such that for any 
partition {xo, ..., Xn} of [a, bd], 


3 A F(xx_1)|? <M. 


k=1 — 1|P- 


7.3 LP iS COMPLETE: THE RIESZ-FISCHER THEOREM 


The concepts of convergent sequence and Cauchy sequence are defined for a sequence in 
a normed linear space in exactly the same way they are for sequences in R, normed by the 
absolute value. 


Definition A sequence { f,} in a linear space X that is normed by || - || is said to converge to f 
in X provided 


lim || f — fnll = 0. 
n—> 00 


We write 


{fn} > f in X or im fr = finX 


to mean that each f, and f belong to X and limy & oo || f — fall = 9. 
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It is clear that for a sequence {f,} and function f in C[a, b], {f,}—> f in C[a, b], normed 
by the maximum norm, if and only if {f,}—> f uniformly on [a, b]. Furthermore, since the 
essential supremum of a function in L™( E) is an essential upper bound, for a sequence { f;,} 
and function f in L™(E), {fn} f in L©(E) if and only if {f,} > f uniformly on the com- 
plement of a set of measure zero. For a sequence { f,} and function f in L?(E),1 < p< oo, 
{fn} > f in L?(E) if and only if 


| 
lim i fe — fl? =0. 


Definition A sequence { f,} in a linear space X that is normed by || - || is said to be Cauchy in 
X provided for each € > 0, there is a natural number N such that 


ll. fn — mil < € for allm,n> N. 
A normed linear space X is said to be complete provided every Cauchy sequence in X 


converges to a function in X. A complete normed linear space is called a Banach space. 


The completeness axiom for the real numbers is equivalent to the assertion that R, 
normed by the absolute value, is complete. This immediately implies that each Euclidean 
space R” also is complete. In a first course in mathematical analysis it is always proven that 
C[a, b], normed by the maximum norm, is complete (see Problem 31). The same argument, 
together with the measurability of pointwise limits of measurable functions, shows that 
L™(E) also is complete (see Problem 33). 


Proposition 4 Let X be a normed linear space. Then every convergent sequence in X is 
Cauchy. Moreover, a Cauchy sequence in X converges if it has a convergent subsequence. 


Proof Let {f,} > f in X. By the triangle inequality for the norm, 


fn — fmoll =f — SF +EF — foil < Ife — fll + fm — fl for all m, n. 
Therefore { f,} is Cauchy. 


Now let { f,} be a Cauchy sequence in X that has a subsequence { f,} which converges 
in X to f. Lete >0. Since { f,} is Cauchy, we may choose N such that || f; — fll p < €/2 for all 
n,m > N. Since { f,,} converges to f we may choose k such that n, > N and || fn, — fllp < €/2. 
Then, by the triangle inequality for the norm, 


I fn — fllp = Wa — fru) + [fn — fll 
= Wlfn — Snyllp + I fn, — fllp <€ forn > N. 
Therefore { f,}—> f in X L] 
In view of the above lemma, a useful strategy to establish the completeness of a 


particular normed linear space is to show that a particular type of Cauchy sequence, tailored 
to the properties of the space, converges and also show that every Cauchy sequence has 


146 Chapter7 The L? Spaces: Completeness and Approximation 


a subsequence of this particular type (see Problems 30 and 32). In the L?(E) spaces, 
1 < p< oo, so-called rapidly Cauchy sequences,’ defined as follows, are useful. 


Definition Let X be a linear space normed by || - ||. .A sequence { fn} in X is said to be rapidly 
Cauchy provided there is a convergent series of positive numbers Die & for which 


I fert — full < <2 for allk. 


It is useful to observe that if {f,} is a sequence in a normed linear space and the 
sequence of nonnegative numbers {a;} has the property that 


Il fx+1 — Sell < ag for all k, 


then, since 
n+k—-1 


fate — In = >, [fin1 — fi] for all n, k, 
j=n 


n+k—1 oO 
I fn+k -— frill < >> lfj41 — Fill < > a; for all n, k. (6) 
j=n j=n 


Proposition 5 Let X be a normed linear space. Then every rapidly Cauchy sequence in X is 
Cauchy. Furthermore, every Cauchy sequence has a rapidly Cauchy subsequence. 


Proof Let {f,} be a rapidly Cauchy sequence in X and Djp1 €k a convergent series of 
nonnegative numbers for which 


I fext — fell < €2 for all k. (7) 
We infer from (6) that 


0, @) 
Ifntk — fall < >) & for all n, k. (8) 


j=n 


Since the series 7° , €, converges, the series >a eé also converges. We infer from (8) 
that {f,} is Cauchy. Now assume that { f,} is a Cauchy sequence in X. We may inductively 
choose a strictly increasing sequence of natural numbers {n;} for which 


Il fogs: — Fngll < (1/2)* for alll k. 


The subsequence { f,, } is rapidly Cauchy since the geometric series with ratio 1//2 converges. 
L 


Theorem 6 Let E be a measurable set and 1 < p < oo. Then every rapidly Cauchy sequence 
in L?( E) converges both with respect to the L?( E) norm and pointwise a.e. on E to a function 
in L?(E). 


3In the article “Rethinking the Lebesgue Integral” (American Math Monthly, December, 2009), Peter Lax 
singles out pointwise limits of sequences of continuous functions that are rapidly Cauchy with respect to the L! 
norm as primary objects in the construction of the complete space L!. He defends the viewpoint that the principal 
object of desire in the program to use theorems about Banach spaces in the study of integration is the identification 
of L!. Lax constructs functions in L! as limits of rapidly Cauchy sequences of continuous functions without first 
making a separate study of measure theory. 
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Proof We leave the case p = oo as an exercise (Problem 33). Assume 1 < p< oo. Let { f,} be 
a rapidly convergent sequence in L?(E). By possibly excising from E a set of measure zero, 
we may assume that each of the /,,’s takes real values. Choose }\°, «, to be a convergent 
series of positive numbers such that 


Ilfer1 — Sullp < &% for all k, (9) 
and therefore 


| feat — fel? < &? for all k. (10) 
E 


Fix a natural number k. Since, for x € E, | frai(x) — fc(x)| = & if and only if | fp4i(x) - 
fa(x)|? = ef, we infer from (10) and Chebychev’s Inequality that 


m{xeE| \fer(t) ~ fielx)|2 ex} =m {xe E | fist) — fe(x)I? > ef} 
1 
<p: | Vin - Al 


<€}. 


Since p > 1, the series $°°, ef converges. The Borel-Cantelli Lemma tells us that there is a 
subset Eo of E that has measure zero and for each x € E ~ Eo, there is an index K(x) such that 


fea (x) — fe(x)| < € for all k > K(x). 
Let x belong to E ~ Ep. Then 


n+k—1 
|fn+k(x) — fr(x)I Ss > lfjsi(x) — F(x) 
j=n 
< Ss €; for alln > K(x) and all k. (11) 
j=n 


The series })°°, €; converges, and therefore the sequence of real numbers { fi,(x)} is Cauchy. 


The real numbers are complete. Denote the limit of { f;(x)} by f(x). It follows from (9) and 


(6) that 
P 


(o @) 
J ifn —frl? < S é for all n, k. (12) 


j=n 
Since { fn} —> f pointwise a.e. on E, take the limit as k > oo in this inequality and infer from 


Fatou’s Lemma that > 


(o,) 
[t- ni s Ys for all n. 
E jan 
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Since the series r2, e, converges, f belongs to L?(E) and {f,}—> f in L?(E). The proof 
is complete since we constructed f as the pointwise limit a.e. on E of { f,}. L 


The Riesz-Fischer Theorem Let E be a measurable set and 1 < p < oo. Then L?(E) isa 
Banach space. Moreover, if {f,}—> f in L?(E), a subsequence of { f,} converges pointwise 
a.e. on E to f. 


Proof Let {f,} be a Cauchy sequence L?(E). According to Proposition 5, there is a 
subsequence { fn,} of {f,} that is rapidly Cauchy. The preceding theorem tells us that { f,,} 
converges to a function f in L?( £) both with respect to the L?( £) norm and pointwise a.e. 
on E. According to Proposition 4 the whole Cauchy sequence converges to f with respect to 
the L?(E) norm. 


As the following example shows, a sequence { f,} in L?(E) that converges pointwise 
a.e.on E to f in L?(E) will not in general converge in L?(E). 


Example For E = [0, 1], 1 < p< oo, and each natural number n, let f, = ni/P X(0,1/n]- The 
sequence converges pointwise on [0, 1] to the function that is identically zero but does not 
converge to this function with respect to the L?[0, 1] norm. 


The next two theorems provide necessary and sufficient conditions for convergence in L?( E) 
for a sequence that converges pointwise. 


Theorem 7 Let E be a measurable set and 1 < p < oo. Suppose { fy} is a sequence in L?( E) 
that converges pointwise a.e. on E to the function f which belongs to L?( E). Then 


(In}—> f in LP(E) ifand onty tim f fal? =f if, 
n—> & E E 


Proof By possibly excising from E a set of measure zero, we may assume f and each f, 
is real-valued and the convergence is pointwise on all of E. We infer from Minkowski’s 
Inequality that, for each n, ||| fallp — Ilfllp| < llfn — fllp. Hence, if {f,} > fin L?(£), then 
limy > 00 fe lfnl? = fl fnl?. To prove the converse, assume limy oo fr lfnl? = fel fal?. 
Define y(t) = ¢? for all t. Then & is convex since its second derivative is nonnegative and thus 


y (222) <a) + WO) oor alta, b. 
2 2 
Hence 
Pp Pp _ Pp 
<r a bY" tor all a,b. 


Therefore, for each n, a nonnegative measurable function h,, is defined on E by 


hy(x) = alaP +A _ me) f)/ for all xe E. 


NR 
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Since {h,}— |f|? pointwise on E, we infer from Fatou’s Lemma that 


[vse <simis|f 


=iming|f MACS 
E 


= [ ist? —timsup J, 1) 
tim sup | f i <0, 


that is, {f,}—> fin L?(E). O 


fn(x) — f(x) 
2 


fn(x) — f(x) 
2 


Thus 
n(x) — f(x) 
2 


Theorem 8 Let E be a measurable set and 1 < p < oo. Suppose { fn} is a sequence in L?( E) 
that converges pointwise a.e. on E to the function f which belongs to L?( E). Then 


{fn} > fin L?(E) 
if and only if 
{| f|?} is uniformly integrable and tight over E. 


Proof The sequence of nonnegative integrable functions {| f, — f|?} converges pointwise a.e. 
on E to zero. According to Corollary 2 of Chapter 5, a corollary of the Vitali Convergence 
Theorem, 


lim - | fn — f|? = Oif and only if {| f,, — f|?} is uniformly integrable and tight over E. 
n> E 


However, we infer from the inequality (1) that for all n, 


lfn — FIP <2? (fal? + IFIP} and | fal? <2? (\fa — fl? +1 fI?} ae. on E. 


By assumption, | f|? is integrable over E, and therefore {| f, — f|?} is uniformly integrable 
and tight over E if and only if the sequence {| f,,|?} is uniformly integrable and tight over E. 
L 


PROBLEMS 
23. Provide an example of a Cauchy sequence of real numbers that is not rapidly Cauchy. 


24. Let X be normed linear space. Assume that {f,}—> f in X, {g,}—> g in X, and a and B are 
real numbers. Show that 
{a fn + Bn} > af + Bg in X. 


25. Assume that E has finite measure and 1 < p, < p2 < oo. Show that if {f,} > f in L”(E), 
then {f,} > fin L?!(E). 
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26. 


27. 


28. 


29. 


30. 


31. 


32. 


33. 
34. 
35. 


(The L? Dominated Convergence Theorem) Let { f,} be a sequence of measurable functions 
that converges pointwise a.e. on E to f. For 1 < p< oo, suppose there is a function gin 
L?(E) such that for all n, | f,| < g a.e. on E. Prove that { f,}—> f in L?(E). 


For E a measurable set and 1 < p < oo, assume {f,}—> f in L?(E). Show that there is a 
subsequence { f,,} and a function g € L?( E) for which | fn, | < g a.e. on E for all k. 


Assume E has finite measure and 1 < p < oo. Suppose {f,} is a sequence of measurable 
functions that converges pointwise a.e. on E to f. For 1 < p<oo, showthat {f,}—> fin L? (E) 
if there is a @ > 0 such that { f,,} belongs to and is bounded as a subset of LP+4(E). 


Consider the linear space of polynomials on [a, b] normed by || - max norm. Is this normed 
linear space a Banach space? 


Let {f,} be a sequence in C[a, b] and 57°, a; a convergent series of positive numbers such 
that 


ll fet — fillmax < ag for all k. 


Prove that 


io. ¢) 
[Fn+k(X) — Fe(x)1 < Wfn+k — fellmax < >) a; for all k,n and all x €[a, 5]. 


J=n 


Conclude that there is a function f € C[a, b] such that { f,}—> f uniformly on [a, 5] 


Use the preceding problem to show that C[a, b], normed by the maximum norm, is a Banach 
space. 


Let {fn} be a sequence in L(E) and 5°, a, a convergent series of positive numbers such 
that 


ll fx+1 — Silloo < ax for all k. 


Prove that there is a subset Ep of E which has measure zero and 


OO 
Ifn+e(x) — fi(x)I < Wfntk — falloo < >) aj for all k,n and all xe E~ Ep. 


j=n 


Conclude that there is a function f ¢ L®(£) such that {f,} > f uniformly on E ~ Ep. 
Use the preceding problem to show that L©(£) is a Banach space. 
Prove that for 1 < p < oo, J? is a Banach space. 


Show that the space of c of all convergent sequences of real numbers and the space co of all 
sequences that converge to zero are Banach spaces with respect to the /° norm. 


7.4 APPROXIMATION AND SEPARABILITY 


We here elaborate on the general theme of Littlewood’s second principle, namely, the 
approximation of functions in one class by ones in a better class. We consider approximation 
with respect to the L?(£) norm. It is useful to introduce the general concept of denseness. 


Definition Let X be a normed linear space with norm || - ||. Given two subsets F and G of X 
with F CG, we say that F is dense in G, provided for each function g in G and «> 0, there is a 
function f in F for which || f — g|| <e. 
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It is not difficult to see that the set F is dense in G if and only if for each g in G there is a 
sequence { f,,} in F for which 
lim fn =gimX. 
n—> 


Moreover, it is also useful to observe that for 7 CGCHCX, 
if F is dense in G and G is dense in H, then F is dense in H. (13) 


We have already encountered dense sets: the rational numbers are dense in R, as are 
the irrational numbers. Moreover, the Weierstrass Approximation Theorem‘ may be stated 
in our present vocabulary of normed linear spaces as follows: the family of polynomials 
restricted to [a, b] is dense in the linear space C[a, b], normed by the maximum norm. 


Proposition 9 Let E be a measurable set and 1 < p < oo. Then the subspace of simple 
functions in L?( E) is dense in L?( E). 


Proof Let g belong to L?(E). First consider p = oo. There is a subset Eg of E of measure 
zero for which g is bounded on E~ Eo. We infer from the Simple Approximation Lemma 
that there is a sequence of simple functions on E ~ Eg that converge uniformly on E ~ Eo 
to g and therefore with respect to the L~(£) norm. Thus the simple functions are dense in 
L®(E). 

Now suppose 1 < p< oo. The function g is measurable and therefore, by the Simple 
Approximation Theorem, there is a sequence {g,} of simple functions on E such that 
{Gn} > g pointwise on E and 
l@n| < |g| on £ for all n. 


It follows from the integral comparison test that each g, belongs to L?(E). We claim that 
{Gn} > gin L?(E). Indeed, for all n, 


lon ~ gl? <2? {lonl? + |gl|?} < 2?* |g]? on E. 


Since |g|? is integrable over FE, we infer from the Lebesgue Dominated Convergence 
Theorem that {¢,}—> gin L?(E). L] 


Proposition 10 Let {a, b] be a closed, bounded interval and 1 < p < oo. Then the subspace 
of step functions on {a, b] is dense in L?[a, b}. 


Proof The preceding proposition tells us that the simple functions are dense in L?[a, bl]. 
Therefore it suffices to show that the step functions are dense in the simple functions, with 
respect to the || - ||, norm. Each simple function is a linear combination of characteristic 
functions of measurable sets. Therefore, if each such characteristic function can be arbitrarily 
closely approximated, in the || - ||) norm, by a step function, since the step functions are a 
linear space, so can any simple function. Let g = y,4, where A is a measurable subset of 
[a, b] and let € > 0, and seek a step function f on [a, b] for which || f — g||p < «. According 
to Theorem 12 of Chapter 2, there is a finite disjoint collection of open intervals, {J;,}/_,, for 


4See Patrick Fitzpatrick’s Advanced Calculus [Fit09] for a proof. 
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which, if we define U = Uj_, i, then the symmetric difference AAU = [A~U]U[UN A] 
has the property that 
m(AAU) <e?. 


Since U/ is the union of a finite disjoint collection of open intervals, yz, is a step function. 
Moreover, 


IIxa — Xullp = [m(AAU)]!/?. (14) 
Therefore ||y4 — xullp < € and the proof is complete. L 


Definition A normed linear space X is said to be separable provided there is a countable 
subset that is dense in X. ) 


The real numbers are separable since the rational numbers are a countable dense 
subset. For [a, b] a closed, bounded interval, C[a, b], normed by the maximum norm, is 
separable since we infer from the Weierstrass Approximation Theorem that the polynomials 
with rational coefficients are a countable set that is dense in C[a, b]. 


Theorem 11 Let E be a measurable set and 1 < p< oo. Then the normed linear space L?(E) 
is separable. 


Proof Let [a, b] be a closed, bounded interval and S[a, b] the collection of step functions 
on [a, b]. Define S’[a, b] to be subcollection of S[a, b] comprising step functions yw on [a, b] 
that take rational values and for which there is a partition P = {xo, ..., xn} of [a, b] with 
w constant on (x,-1, x,), for 1 < k <n, and x, rational for 1 < k <n —1. We infer from 
the density of the rational numbers in the real numbers that S’[a, b] is dense in S[a, b], with 
respect to the L?( E) norm. We leave it as an exercise to verify that S’[a, b] is a countable set. 
There are the following two inclusions, each of which is dense with respect to the L?[a, b] 
norm: 
S'[a, b] C S[a, b] C L?[a, 5]. 


Therefore, by (13), S’[a, b] is dense in L?[a, b]. For each natural number n, define F,, to 
be the functions on R that vanish outside [—n, n] and whose restrictions to [—n, n] belong 
to S'[—n, n]. Define F = U,en Fn. Then F is a countable collection of functions in L?(R). 
By the Monotone Convergence Theorem, 


P p P 
Jim fis [ LF? for all f € L?(R). 


Therefore, by the choice of each F,,, F is a countable collection of functions that is dense in 
L?(R). Finally, let E be a general measurable set. Then the collection of restrictions to E of 
functions in F is a countable dense subset of L?(E), and therefore L?(E) is separable. O 


As the following example shows, in general L©( E) is not separable. 


Example Let ([a, b] be anondegenerate closed, bounded interval. We claim that the normed 
linear space L™[a, b] is not separable. To verify this claim, we argue by contradiction. 
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Suppose there is a countable set {f,}°°, that is dense in L™[a, b]. For each number 
x €[a, b}, select a natural number 7(x) for which 


WX{a, x] ~ Fa(x)lloo < 1/2. 


Observe that 
WX{a, x1] — X{a,x9] loo =lifa <xj <x <b. 


Therefore 7 is a one-to-one mapping of [a, b] onto a set of natural numbers. But a set of 
natural numbers is countable and [a, b] is not countable. We conclude from this contradiction 
that L™[a, b] is not separable. 


For a measurable subset E of R, we denote by C,(E) the linear space of continuous 
real-valued functions on E that vanish outside a bounded set. In the proof of the above 
theorem, for 1 < p < 00, we presented a dense subset F of L?(R) with the property that 
for each f € F, there is a closed, bounded interval [a, b] for which the restriction of f 
to [a, b] is a step function and f vanishes outside [a, b]. It is not difficult to see that each 
f € F is the limit in L?(R) of a sequence of continuous, piecewise linear functions, each 
of which vanish outside a bounded set. Define ¥’ to be the union of all such approximating 
sequences of functions in F. Then ¥’ is dense in L?(R). Moreover, for E a measurable set, 

the collection of restriction to E of functions belonging to F’ is a dense subset of L?(E) 
consisting of continuous functions on E that vanish outside a bounded set. This proves the 
following theorem. 


Theorem 12 Let E be a measurable set and 1 < p< oo. Then C.(E) is dense in L?( E). 


PROBLEMS 


36. Let S be a subset of a normed linear space X. Show that S is dense in X if and only if each 
g €X is the limit of a sequence in S. 


37. Verify (13). 
38. Prove that the collection of polynomials with rational coefficients is countable. 


39. Let E be a measurable set, 1 < p < 00, q the conjugate of p, and S a dense subset of L4(E). 
Show that if g¢ L?(E) and lef -g=Ofor all f eS, then g=0. 


40. Verify the details in the proof of Theorem.11. 


41. Let E be a measurable set of finite measure and 1 < p, < pp < 00. Consider the linear Space 
L”?( E) normed by || - ||p,. Is this normed linear space a Banach space? 


42. Exhibit a measurable set EF for which L™(E) is separable. Show that um E) is not separable 
if the set EF contains a nondegenerate interval. 


43. Suppose that X isa Banach Space with norm || - ||. Let Xo be a dense subspace of X. Assume 
that Xo, when normed by the norm it inherits from X, is also a Banach space. Prove that 
X= Xo. | 


44. For1 < p<o, show that the Sequence space £? is separable. Show that the collection of sets 
of natural numbers is uncountable and conclude that £© is not separable. 
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45. 
46. 


47. 


48. 


49. 


50. 


S1. 


Prove Theorem 12. 


Show that for 1 < p < oo and any two numbers a and b, 


sen(a) - Jal!’ — sgn(b) -\o1!/?|" <2” - a — by 


Show that for 1 < p < oo and any two numbers a and J, 
|sgn(a) - Jal?” — sgn(b) - |bI?| < p- la — bl(lal + |ol)?-!. 


(Mazur) Let E be a measurable set and 1 < p < oo. For f in L'(E), define the function ®( f ) 
on E by 


(f(x) = sent f(x) F(a)”. 
Show that ®( f) belongs to L?( E). Moreover, use Problem 46 to show that 


I®( fF) — B(g)llp <2? - If — gil: forall f, gin L'(E). 


From this conclude that ® is a continuous mapping of L'(£) into L?(£) in the sense that 
if {f,} > f in L'(E), then {®( f, )} > ®( f) in L?(E). Then show that ® is one-to-one and 
its image is L?(E). Find a formula for the inverse mapping. Use the preceding problem to 
conclude that the inverse mapping ®~! is a continuous mapping from L?(E) to L'(E). 


Use the preceding problem to show that the separability of L!( £) implies the separability of 
L?(E), forl < p<oo. 


For [a, b] a nondegenerate closed, bounded interval, show that there is no continuous 
mapping ® from L![a, b] onto L®{a, 5]. 


Use Lusin’s Theorem to prove Theorem 12. 
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For a measurable set E, 1 < p < ov, and q conjugate to p, let g belong to L4( E). Define the 
real-valued functional T on L?( EF) by 


r(f)= | f-eforall fe L"(E). (i) 


Holder’s Inequality tells us that f - g is integrable and therefore T is properly defined. The 
functional T inherits linearity from the linearity of integration. Furthermore, there isa M > 0 
for which 

IT(f)| <M -||fllp for all f € L?(E). (ii) 


Indeed, by Hélder’s Inequality, this holds for M = ||g||z. The Riesz Representation Theorem 
asserts that if T is any real-valued linear functional on L?(E) with the property that there 
is an M for which (ii) holds, then there is a unique g in L4(E) for which T is given by (i). 
A sequence { f,} of functions in L?( E£) to said to converge weakly to a function f in L?(E) 
provided 
tim | fr-g= | f-gforall ge LE). (iii) 
n—->& E E 
We use the Riesz Representation Theorem and a theorem of Helley to show that, for 
1 < p < oo, any bounded sequence in L?(E) has a weakly convergent subsequence. As 
an example of just one of the many consequences of this result, we prove the existence of 
minimizers for certain convex functionals. 


8.1 THE RIESZ REPRESENTATION FOR THE DUAL OF L’, 1 < P < co 


Definition A linear functional on a linear space X is a real-valued function T on X such that 
for g and h in X and a and B real numbers, 


T(a-g+B-h)=a-T(g)+B-T(h). 
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It is easy to verify that the linear combination of linear functionals, defined pointwise, 
is also linear. Thus the collection of linear functionals on a linear space is itself a linear space. 


Example Let E be a measurable set, 1 < p < oo, q the conjugate of p, and g belong to 
L1( E). Define the functional T on L?(E) by 


r()= | ¢- f forall fe"(E), (1) 


Holder’s Inequality tells us that for f ¢ L?(E), the product g - f is integrable over E so the 
functional T is properly defined. By the linearity of integration, T is linear. Observe that 
Holder’s Inequality is the statement that 


IT(F)| < Igllg- Ifllp for all f € L?(E). (2) 
Example Let [a, b] be aclosed, bounded interval and the function g be of bounded variation 


on [a, b]. Define the functional T on C[a, b] by 


b 
T(f) =| f(x) dg(x) for all f € C[a, 5], (3) 


where the integral is in the sense of Riemann-Stieltjes. The functional T is properly defined 
and linear.! Moreover, it follows immediately from the definition of this integral that 


IT(f)| < TV(g) - ll fllmax for all f €C[a, 5], (4) 


where TV( g) is the total variation of g over [a, b]. 


Definition For a normed linear space X, a linear functional T on X is said to be bounded 
provided there is an M > 0 for which 


IT(f)|<M- fll forall f ex. (5) 
The infimum of all such M is called the norm of T and denoted by ||T |\x. 


The inequalities (2) tell us that the linear functional in the first example is bounded, 
while inequalities (4) do the same for the second example. 

Let T be a bounded linear functional on the normed linear space X. It is easy to see 
that (5) holds for M = ||T||,. Hence, by the linearity of T, 


IT(f) —T(A)| < IT lle: If — All for all f, he X. (6) 
From this we infer the following continuity property of a bounded linear functional T: 


if (fn) > f in X, then {T( fn)} > T(f). (7) 


1See Chapter 2 of Richard Wheedon and Antoni Zygmund’s book Measure and Integral [WZ77] regarding 
Riemann-Stieltjes integration. 
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We leave it as an exercise to show that 


Tle =sup {7(f)| fex, I fll <1}, (8) 


and use this characterization of || - ||, to prove the following proposition. 


Proposition 1 Let X be a normed linear space. Then the collection of bounded linear 
functionals on X is a linear space on which || - ||, is anorm. This normed linear space is called 
the dual space of X and denoted by X*. 


Proposition 2 Let E be a measurable set, 1 < p < ov, q the conjugate of p, and g belong to 
L1( E). Define the functional T on L?( E) by 


r(f)= | ¢- fforall fe"), 


Then T is a bounded linear functional on L?( E) and |\T ||, = || 8ll¢- 


Proof We infer from (2) that T is a bounded linear functional on L?( E) and ||T'l|x < |lgllg.On 


the other hand, according to Theorem 1 of the preceding chapter (with p and q interchanged), 
the conjugate function of g, g* = gilt sen(g)|g|?~!, belongs to L?(E), 


T(g*) = llgllg and |lg*l|p = 1. 


It follows from (8) that ||7|. = llgllg- CO 


Our goal now is to prove that for 1 < p < ov, every bounded linear functional on 
L?(E) is given by integration against a function in L4(E), where q is the conjugate of p. 


Proposition 3 Let T and S be bounded linear functionals on a normed linear space X. If 
T = S ona dense subset Xo of X, thenT = S. 


Proof Let g belong to X. Since Xo is dense in X, there is a sequence {g,} in Xp that converges 
in X to g. We infer from (7) that {S( g,)}—> S(g) and {T(gn)} > T(g). But S( gn) = T( gn) 
for alln, and hence S(g) = T(g). O 


Lemma 4 Let E be a measurable set and 1 < p < oo. Suppose the function g is integrable 
over E and there is an M > 0 for which 


[es 


Then g belongs to L4( E), where q is the conjugate of p. Moreover, ||g\|q < M. 


< M\|f\lp for every simple function f in L?(E). (9) 
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Proof Since g is integrable over E, it is finite a.e. on E. By possibly excising a set of measure 
zero from E, we assume g is finite on all of E. We first consider the case p > 1. Since |g| 1s a 
nonnegative measurable function, according to the Simple Approximation Theorem, there 
is a Sequence of simple functions {g,,} that converges pointwise on E to |g| and 0 < @» < |g| 
on E for all n. Since {7} is a sequence of nonnegative measurable functions that converges 
pointwise on E to |g|?, by Fatou’s Lemma, to show that |g|? is integrable over E and 
llgll, < M it suffices to show that 


| g? < M? for alln. (10) 
E 


Fix a natural number n. To verify (10) we estimate the functional values of 97 on E as 
follows: 


94 =9n- gh! <|el- of" = g-sgn(g)- ef! on E. (11) 
We define the simple function f, by 


fn = sgn(g)- 4! on E. 


The function ¢,, is integrable over E since it is dominated on E by the integrable function g. 
Therefore, since ¢, is simple, it has finite support, and hence f, belongs to L?( E). We infer 
from (11) and (9) that 


nS [ e-fe < Ml fnllp. (12) 
E VE 


Since q is the conjugate of p, p(q — 1) = q and therefore 


We rewrite (12) as 


Since || is integrable over E, we may regather this integral inequality as 


1-1/p 
| gf < M, 
E 


which, since 1 — 1/ p = 1/g, is a restatement of (10). 


It remains to consider the case p = 1. We must show that M is an essential upper 
bound for g. We argue by contradiction. If M is not an essential upper bound, then, by the 
continuity of measure, there is some € > 0, for which the set E, = {x€ E||g(x)| > M+} 
has nonzero measure. If we let f be the characteristic function of a measurable subset of E, 
that has finite positive measure, we contradict (9). L 
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Theorem 5 Let [a, b] be a closed, bounded interval and 1 < p < oo. Suppose T is a bounded 
linear functional on L?{a, b]. Then there is a function g in L4[a, b], where q is the conjugate 
of p, for which 


T(f) = | ef forall f in L?a, bl]. 
I 


Proof We consider the case p > 1. The proof of the case p = 1 is similar. For x in [a, 5], 
define 


®(x) = T(X{a,x)). 
We claim that this real-valued function ® is absolutely continuous on [a, b]. Indeed, by the 
linearity of T, for each [c, d] C [a, b], since yj. 4) = Xfa,a) — X{a,c)> 
®(d) — O(c) = T(Xfa,a)) — T(Xfa,c)) = T(X[e,4)): 


Thus if {(a;, by )}7_, is a finite disjoint collection of intervals in (a, b), by the linearity of T, 
S [P(bx) - 8(a1)| = 3 -Txta.n)) = (3 ck tian) " 
k=l k=1 k=l 


where each e, = sgn[®(b; ) — ®( a; )]. Moreover, for the simple function f = > Ek X[ap, by)» 
k=l 


n 


1/p 
IT( A) < IT le: WF ilp and Il fllp = Se 4) 


k=1 
Thus 


n 


n 1/p 
py |D(d;) — O(a; )| < (TI. - Se =) 
k= 


k=1 


Therefore, 6 = (€/||T||..)? responds to any € > 0 challenge regarding the criterion for ® to 
be absolutely continuous on [a, ]. 


According to Theorem 10 of Chapter 6, the function g = ®’ is integrable over [a, b] 
and 


O(x) = g for all x €[a, bd]. 
0 
Therefore, for each [c, d] C (a, b), 


b 
T(xjaa)) = (4) - O(c) = fF -x.4) 


Since the functional 7’? and the functional f> f g- f are linear on the linear space of step 
functions, it follows that 


b 
T(f)= | g- f for all step functions f on [a, DJ. 


2The functional T must respect the equivalence relation of equality a.e. on [a, b] among functions in L?[a, b]. 
In particular, fora <c <d <b, T( XY. ay) = T(X(c,d)) =T(X{c, d]): 
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By Proposition 10 of the preceding chapter and its proof, if f is a simple function on [a, bl], 
there is a sequence of step functions {¢,} that converges to f in L?[a, b] and also is uniformly 
pointwise bounded on [a, 5]. Since the linear functional T is bounded on L?[a, 5], it follows 
from the continuity property (7) that 

‘lim (en) =T(f) 


On the other hand, by the Lebesgue Dominated Convergence Theorem, 
: b b 
lim | e-on= | g:f. 
n> ow a a 


b 
T(f)= | g- f for all simple functions f on [a, 5]. 
a 


Therefore 


Since T is bounded, 


b 
| es = |T(f)| < IIT lx - || fllp for all simple functions f on [a, bj. 


According to Lemma 4, g belongs to L#[a, b]. It follows from Proposition 2 that the linear 
functional f > lr g- f is bounded on L?{a, b]. This functional agrees with the bounded 
functional T on the simple functions, which, by Proposition 9 of the preceding chapter, is a 


dense subspace of L?[a, b]. We infer from Proposition 3 that these two functionals agree on 
all of L?[a, 5]. O 


The Riesz Representation Theorem for the Dual of L’(E) Let E be a measurable set, 
1 < p<oo, and q the conjugate of p. For each g € L4( E), define the bounded linear functional 
Ry on L?( E) by 


Re(f) = |g: f forall fin P(E) (14) 


Then for each bounded linear functional T on L?( E), there is a unique function g € L1( E) 
for which 
R, =T, and ||T lx = lalla. (15) 


Proof Proposition 2 tells us that for each g€ L7(E), Ry is a bounded linear functional on 
L?(E) for which ||R¢||x = ||gllg. By the linearity of integration, for each gi, g2 € L4(E), 


Re _ Reo = Rei-go- 


Thus if Ry, = Rg,, then R,,-», = 0 and hence ||g1 — g2||, = 0 so that g1 = go. Therefore, for 
a bounded linear functional T on L?( E), there is at most one function g € L4( E) for which 
R, =T. It remains to show that for each bounded linear functional T on L?(E), there is a 
function g € L4(E) for which T = R,. The preceding theorem tells us that this is so for E 
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a closed, bounded interval. We now verify this for E = R and then for general measurable 
sets E. 


Let T be a bounded linear functional on L?(R). Fix a natural number n. Define the 
linear functional T,, on L?[—n, n] by 


Tn( f) =T(f) for all f € L?[—n, nl, 


where f is the extension of f to all of R that vanishes outside [—n, n]. Then, since 
lfllp = fll. 
ITn( Ff) | < Tall fllp for all f ¢ L?[—n, n]. 


Thus ||Tnllx < ||Tllx. The preceding theorem tells us there is a function g, EL4[-n, n] 
for which 


Th{ f) =| 8n° f for all fe L?[—n, n] and [lgnllg = Tulle < IT lle. (16) 


By the remarks regarding uniqueness at the beginning of this proof, the restriction of g,41 to 
[—n, n] agrees with g, a.e. on [—n, n]. Define g to be a measurable function on R which, for 
each n, agrees with g, a.e. on [—n, n]. We infer from the definitions of T, and gn, together 
with the left-hand equality in (16), that for all functions f ¢ L?(R) that vanish outside a 
bounded set, 


T(f)= [ g:f. 
R 
By the right-hand inequality in (16), 


n 
| lel? < (IIT I+)? for all n 


and hence, by Fatou’s Lemma, g belongs to L4(R). Since the bounded linear functionals Rg 
and T agree on the dense subspace of L?(R) comprising the L?(R) functions that vanish 
outside a bounded set, it follows from Proposition 3 that R, agrees with T on all of L?(R). 


Finally, consider a general measurable set E and T a bounded linear functional on 
L?(E). Define the linear functional T on L?(R) by T( f) = T( f|z). Then T is a bounded 
linear functional on L?(R). We have just shown that there is a function 3 € L4(R) for which 


T is represented by integration over R against %. Define g to be the restriction of # to E. 
Then T = Ry. L] 


Remark In the second example of this section, we exhibited Lebesgue-Stieltjes integration 
against a function of bounded variation as an example of a bounded linear functional on 
Cla, b]. A theorem of Riesz, which we prove in Chapter 21, tells us that all the bounded 
linear functionals on C[a, b] are of this form. In Section 5 of Chapter 21, we characterize the 
bounded linear functionals on C( K), the linear space of continuous real-valued functions on 
a compact topological space K, normed by the maximum norm. 


Remark Let[a, b] be a nondegenerate closed, bounded interval. We infer from the linearity of 
integration and Hélder’s Inequality that if f belongs to L'{a, b], then the functional go I ? f-g 
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is a bounded linear functional on L™{a, b]. It turns out, however, that there are bounded 
linear functionals on L™[a, b] that are not of this form. In Section 3 of Chapter 19, we prove 
a theorem of Kantorovitch which characterizes the dual of L™. 


PROBLEMS 


1. Verify (8). 


10. 


11. 


. Prove Propositon 1. 


. Let T be a linear functional on a normed linear space X. Show that T is bounded if and only 


if the continuity property (7) holds. 


. A functional T on a normed linear space X is said to be Lipschitz provided there is ac > 0 


such that 
IT(g) —T(h)| < cllg — hil forall g,he xX. 


The infimum of such c’s is called the Lipschitz constant for T. Show that a linear functional is 
bounded if and only if it is Lipschitz, in which case its Lipschitz constant is ||T||x. 


. Let E be a measurable set and 1 < p < oo. Show that the functions in L?(E) that vanish 


outside a bounded set are dense in L?( E). Show that this is false for L°(R). 


. Establish the Riesz Representation Theorem in the case p = 1 by first showing, in the notation 


of the proof of the theorem, that the function ® is Lipschitz and therefore it is absolutely 
continuous. Then follow the p > 1 proof. 


. State and prove a Riesz Representation Theorem for the bounded linear functionals on /?, 


1<p<om. 


. Let c be the linear space of real sequences that converge to a real number and co the subspace 


of c comprising sequences that converge to 0. Norm each of these linear spaces with the £° 
norm. Determine the dual space of c and of co. 


. Let [a, b] be a closed, bounded interval and C[a, b] be normed by the maximum norm. Let 


xq belong to [a, b]. Define the linear functional T on C[a, b] by T( f) = f (xo). Show that 
T is bounded and is given by Riemann-Stieltjes integration against a function of bounded 
variation. 


Let f belong to C[a, b]. Show that there is a function g that is of bounded variation on [a, 5] 
for which 


b 
| f dg =f llmax and TV(f) =1. 


Let [a, b] be a closed, bounded interval and C[a, b] be normed by the maximum norm. Let 
T be a bounded linear functional on C[a, b]. For x € [a, b], let g, be the member of C[a, b] 
that is linear on [a, x] and on [x, b] with g,(a) = 0, g,(x) =x —aand g,(b) = x — a. Define 
®(x) = T(g,) for x € [a, b]. Show that ® is Lipschitz on [a, 5]. 


8.2 WEAK SEQUENTIAL CONVERGENCE IN L? 


The Bolzano-Weierstrass Theorem for the real numbers is the assertion that every bounded 
sequence of real numbers has a convergent sequence. This property immediately extends 
to bounded sequences in each Euclidean space R”. This property fails in an infinite 
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dimensional normed linear space.> In particular, the following example shows that for 
1 < p < o, there are bounded sequences in L?(0, 1] that fail to have any subsequences that 
converge in L?[0, 1]. The functions defined in the following example are called Radamacher 
functions. 


Example For / = [0, 1] and a natural number n, consider the step function f, defined 
on J by | 
fnr(x) = (-1)* for k/2" <x <(k+1)/2" where 0 <k <2" —-1. 


Fix 1 < p < oo. Then {f,} is a bounded sequence in L?(J): indeed, || fnllp < 1 for every 
index n. On the other hand, since, for n #m, | fn — fm| takes the value 2 on a set of measure 
1/2, || fn — fmllp = (2)'~'/?. Therefore no subsequence of { f,} is Cauchy in L? (1) and hence 
no subsequence can converge in L?(I). We also note that no subsequence can converge 
pointwise almost everywhere on / since, for 1 < p <0, if there were such a subsequence, | 
by the Bounded Convergence Theorem it would converge in L?(J). 


Definition Let X be a normed linear space. A sequence { f,} in X is said to converge weakly 
in X to f in X provided 

jim T( fn) =T(f) for all T € X*. 

n— 
We write 

{fn} — fin xX 
to mean that f and each f, belong to X and { f,} converges weakly in X to f. 
We continue to write {f,}— f in X to mean that limn + 0 | fn — f|| = 0 and, to 


distinguish this mode of convergence from weak convergence, often refer to this mode of 
convergence as strong convergence in X. Since 


IT( fn) —T( Ff) =IT( fa — FI < WIT lle: fa — fll for all T € X*, 


if a sequence converges strongly, then it converges weakly. The converse is false. 


Proposition 6 Let E be a measurable set, 1 < p<ovo, and gq the conjugate of p. Then { f,} — f 
in L?( E) if and only if 


nh >OO 


lim | g- fh = | &- F forall ge1%(E) 
E E 


Proof The Riesz Representation Theorem tells us that every bounded linear functional on 
L?( E) is given by integration against a function in L4( E). - L] 


For £ a measurable set and 1 < p < oo, a sequence in L?( E) can converge weakly to 
at most one function in L?(£). Indeed, suppose { f,} converges weakly in L?(E) to both f; | 
and f2. Consider the conjugate function of f; — fy, (fi — f2)*. Then 


[py t= tim, [nn th= [i= pyr 
E E E 


3Riesz’s Theorem, which we prove in Section 3 of Chapter 13, tells us that in every infinite dimensional normed 
linear space X, there is a bounded sequence that has no subsequence that converges in X. 
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Therefore 


lA - fallp = Ke - hy (A- fr) =0. 
Thus f; = f2 and therefore weak sequential limits are unique. 
Theorem 7 Let E be a measurable set and 1 < p < ov. Suppose { fn} — f in L?(E). Then 
{ fn} is bounded in L?( E) and || f||p < liming || fillp- (17) 


Proof Let g be the conjugate of p and f* the conjugate function of f. We first establish the 
right-hand inequality of (17). We infer from Hélder’s Inequality that 


[ir ‘fn SMF Iq: Wfallp = Wfallp for all n. 


Since { f,} converges weakly to f and f* belongs to L4(E), 


n—-> 0 


Ifly= | f-f= lim ff f= timing I fly 


We argue by contradiction to show that { f,} is bounded in L?(E). Assume {|| fn || p} is 
unbounded. Without loss of generality (see Problem 18), by possibly taking scalar multiples 
of a subsequence, we suppose 


fnllp =n - 3” for all n. (18) 


We inductively select a sequence of real numbers {«,} for which «, = +1/3* for each k. 
Define €; = 1/3. If n is a natural number for which €1, ..., €, have been defined, define 


€n+1 = 1/3"*! if i » €( fn y *fr4i 2 9, 
k=1 


and €,,; = —1/3"*! if the above integral is negative. Therefore, by (18) and the definition 
of conjugate function, 


| » a Sn 
E | k=1 


Since |lex - (fe )*llq = 1/3* for all k, the sequence of partial sums of the series 3%, ex « ( fx)* 
is a Cauchy sequence in L4( EF). The Riesz-Fischer Theorem tells us that L4( E) is complete. 
Define the function g € L4( EF) by 


> 1/3" fallp = and |len - (fa) *Ilg = 1/3” for all n. (19) 


g= Dea (h)*. 
k=1 
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Fix a natural number n. We infer from the triangle inequality, (19), and Hdlder’s Inequal- 


ity that 
/ & Sn 
E 


-|/ Sa-cnr]-s 
E | k=1 
[\S<-un'|-s -|f >) c(h 
E |k=1 E | k=n+1 
[\3 «(Hf 
E | k=n+1 


>n— | »> vst “WWnllp 
=n+1 


=n—1/3" -1/2-||fallp 
=n/2. 


> 


zn 


This is a contradiction because, since the sequence {f,} converges weakly in L?(E) and 
g belongs to L4(E), the sequence of real numbers {7g - f,} converges and therefore is 
bounded. Hence { f,,} is bounded in L?. L] 


Corollary 8 Let E be a measurable set, 1 < p < 00, and q the conjugate of p. Suppose { fn} 
converges weakly to f in L?( E) and {g,} converges strongly to g in L4(E). Then 


lim in fe= | ef (20) 
E E 


n> Ww 


Proof For each index n, 


[ee te- [e-t= [leant fe t-fes. 
E E E E E 


According to the preceding theorem, there is a constant C > 0 for which 
Ifallp < C for all n. 


fet fos 


im. lig — gllg =Oand lim fem=fes 
— n—> CW E E 


it follows that (20) holds. LI 


Therefore, by Hélder’s Inequality, 


[ete fet 
E E 


From these inequalities and the fact that both 


<C-|lgn — gllg+ for all n. 
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By the linear span of a subset S of a linear space X we mean the linear space consisting 
of all linear combinations of functions in S, that is, the linear space of functions of the form 


f=> om fe 
k=1 


where each a, is a real number and each f; belongs to S. 


Proposition 9 Let E be a measurable set, 1 < p < 00, and q the conjugate of p. Assume F 
is a subset of L4(E) whose linear span is dense in L4( E). Let { f,} be a bounded sequence in 
L?(E) and f belong to L?(E). Then { f,} — f in L?(E) if and only if 


lim | fp: g= [ ¢-g forall geF. (21) 
E JE 


n> © 


Proof Proposition 6 characterizes weak convergence in L?(E). Assume (21) holds. To verify 
weak convergence, let gq belong to L4(E). We show that limn-; oo f edn 80=Jnf - 80. Let 
€ > 0. We must find a natural number N for which 


| te-eo- f -8 


Observe that for any g € L4( E) and natural number n, 


| u-s0- | f-80= | (fe-F)-(00-8)+ [ (m-S)-8. 
E E E E 
and therefore, by Hélder’s Inequality, 


ff t-e0-f F- 


Since { f,} is bounded in L?(£) and the linear span of F is dense in L4( E), there is a function 
g in this linear span for which 


<ecifn>N. (22) 


<lIfe~ filp-lle~aolle+|ff foe ft) 


Ilfn — Fly - \18 — 8ollg < €/2 for all n. 


We infer from (21), the linearity of integration, and the linearity of convergence for sequences 
of real numbers, that 


dim, [ t-e= [4-0 


Therefore there is a natural number N for which 


[tee] F-6|<e(2itne w. 


By the preceding estimates it is clear that (22) holds for this choice of N. UO 
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According to Proposition 9 of the preceding chapter, for 1 <q < oo, the simple 
functions in L4(E) are dense in L4(E), and these functions have finite support if g < oo. 
Moreover, Proposition 10 of the same chapter tells us that for a closed, bounded interval 
[a, b] and 1 < q <0, the step functions are dense in L4 [a, b]. Therefore the following two 
characterizations of weak continuity follow from the preceding proposition. 


Theorem 10 Let E be a measurable setand1 < p<oo. Suppose { f,} is a bounded sequence in 
L?(E) and f belongs to L?(E). Then { f,} —~ f in LP (E) if and only if for every measurable 


subset A of E, 
n> Ww A A 


If p > 1, it is sufficient to consider sets A of finite measure. 


Theorem 11 Let [a, b] be a closed, bounded interval and 1 < p < oo. Suppose {fy} is a 
bounded sequence in L?[a, b] and f belongs to L?[a, b]. Then { f,} — f in L?[a, b] if and 
only if | 


x X 
lim J a = | f for all x in [a, bj. (24) 
naw a a 
Theorem 11 is false for p = 1, since the step functions are not dense in L™[a, b]: see Problem 44. 


Example (the Riemann-Lebesgue Lemma) Let J = [—7, a] and 1 < p< oo. For each 
natural number n, define f,(x) = sinnx for xin J. Then lfn| < 1 on J for each n, so {f,} 
is a bounded sequence in L?(I). The preceding corollary tells us that the sequence { f,} 
converges weakly in L?(I) to f =0 if and only if 


X 
im, f sinnt dt = 0 for allxeJ. 
n—>ooj 


Explicit calculation of these integrals shows that this is true. On the other hand, observe that 


for each n, 
qT qT 
| | sinne|? dt = | sin’ ntdt = 7. 
a |_ | 


Thus no subsequence of { f,,} converges strongly in L?(J) to f =0. A similar estimate shows 
no subsequence converges strongly in any L?(1). Therefore by the Bounded Convergence 
Theorem, no subsequence of { f,} converges pointwise almost everywhere on I to f =0. 


Example For a natural number n, define tn =~ X(0,1/n] On [0, 1]. Define f to be 


identically zero on [0, 1]. Then { f,} is a sequence of unit functions in L'[0, 1] that converges 
pointwise to f on [0, 1]. But {f,} does not converge weakly to f in L'[0, 1] since, taking 
§ = xj0,1)€ L™[0, 1], 


1 1 1 1 
lim | g-f,= tim ff, =1white [- p= f ¢a1 
n> oo 0 . n> oo 0 0 0 


Example Define the tent function fo on R to vanish outside (—1, 1), be linear on the 
intervals [—1, 0] and [0, 1] and take the value 1 at x = 0. For each natural number n, define 
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In(x) = fo(x —n) and let f =0 on R. Then {f,} > f pointwise on R. Let 1 < p< oo. The 
sequence {f,} is bounded in L?(R). We leave it as an exercise in the use of continuity of 
measure to show that for a set of finite measure A, 


‘lim [ f= [ f, (25) 


and thereby infer from Theorem 10 that, for 1 < p < 00, {f,} — f in L?(R). But { f,} does 
not converge weakly to f in L1(R) since for g=1 on R, g belongs to L®(R), while { te fn} 
does not converge to fp f. 


The preceding two examples exhibit bounded sequences in L'(E) that converge 
pointwise to a function in L'(E) and yet do not converge weakly in L'(E). This does not 
occur in L?( E) if 1 < p< oo. 


Theorem 12 Let E be a measurable set and 1 < p < oo. Suppose { fn} is a bounded sequence 
in LP( E) that converges pointwise a.e. on E to f. Then { fn} — f in L?(E). 


Proof We infer from Fatou’s Lemma, applied to the sequence {| f,|?}, that f belongs to 
L?(E). Theorem 11 tells us that to verify weak sequential convergence it is necessary and 
sufficient to show that for each measurable subset A of E of finite measure, 


lim [ fn= [ f (26) 


Let A be such a set. According to Corollary 2 of the preceding chapter, since the sequence 
{fn} is bounded in L?( E), it is uniformly integrable over E. But m(A) < oo. Therefore, by 


the Vitali Convergence Theorem, (26) holds. O 
The Radon-Riesz Theorem Let E be a measurable set and 1 < p < oo. Suppose { fn} — f in 
L?(E). Then 

(fn) > f in L?(E) if and only if lim Il fallp = lIf'lp 
Proof It is always that case, in any linear space X normed by || - ||, that strong convergence 


implies convergence of the norms. Indeed, this follows from the following consequence of 
the triangle inequality: 


llgll — HAI S lg — All for all g, A in X. 


It remains to show that in the L?( E) spaces, 1 < p < oo, weak convergence and convergence 
of the norms implies strong convergence in L?( E). We present the proof for the case p = 2.4 
Let { fn} be a sequence in L?( E) for which 


nj} fin L?(E)and lim | f?= | f?. 
(fn} — fin L?(E) an sim, | [ir 


4For the proof for general p > 1, a substitute is needed for the identity (a — b)* = a* — 2ab + b*. A detailed 
proof is provided in Frigyes Riesz and Béla Sz.-Nagy’s Functional Analysis [RSN90], pages 78-80. 
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Observe that for each n, 


Ife F18= fit 4P =f te r= ime-2fosr+ fie 


Since f belongs to L4(E) = L*(E), 


dim | te-t= [ P. 


Therefore { f,}—> f in L*(E). L 


Corollary 13 Let E be a measurable set and 1 < p < oo. Suppose { fy} — f in L?(E). Then 
a subsequence of { fn} converges strongly in L?(E) to f if and only if 


lfllp = lim inf || frllp. 


Proof If || /||, = liminf || f,||p, then there is a subsequence { f,, } for which lim _, o9 || fn ilp= 
ll fllp- The Radon-Riesz Theorem tells us that { f,,} converges strongly to f in L?(E). Con- 
versely, if there is a subsequence {fn,} that converges strongly to f, then lim; -, g || fn lp = 
Il fllp. Thus lim inf || fnllp < ll fllp. The right-hand inequality in (17) is this inequality in the 
opposite direction. O 


As the following example shows, the Radon-Riesz Theorem does not extend to the 
case p= 1. 


Example For each natural number n, define f,(x) = 1+sin(nx) on J = [—7, 7]. It follows 
from the Riemann-Lebesgue Lemma that the sequence { f,} converges weakly in L'(1) to the 
function f =1. Since each f, is nonnegative, we therefore also have lim, -. 66 |l fall = || fll. 
Since {sin(nx )} does not converge strongly in L!( I), { f,} does not converge strongly in L1(). 


Remark For E a measurable set, 1 < p< oo, f €L?(E), and f* the conjugate function of 
f, define T € (L?(E))* by 


r(h) =f f*-hforallheL"(E). 


Rewrite (17) as 

T(f) =f lp and ||T ||, = 1. (27) 
In Section 2 of Chapter 14, we prove the Hahn-Banach Theorem and as a corollary of this 
theorem show that if X is any normed linear space and f belonging to X, there is a bounded 
linear operator T in X* for which T( f) = ||f\| and ||T\|, = 1. For the L?(E) spaces, the 
conjugate function is a concrete presentation of this abstract functional. 


PROBLEMS 


12. Show that the sequence defined in the first example of this section does not converge strongly 
to f=0in L?(0, 1] for all1 < p<ov. 
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13. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 


22. 


23. 


24. 


Fix real numbers a and £. For each natural number n, consider the step function f,, defined 
on J = [0, 1] by 


fn(x) = (1 —(-1)")ae/2 + (1 + (-1)*) 8/2 for k/2" <x <(k+1)/2", O<k <2" 1. 


For 1 < p < 00, show that { f,,} converges weakly in L?(/) to the constant function that takes 
the value (a + B)/2. For a#£, show that no subsequence of { f,} converges strongly in L?(/). 


Let h be acontinuous function defined on all of R that is periodic of period T and iM h=0.Let 
[a, b] be a closed, bounded interval and for each natural number n, define the function f, on 
[a, b] by f,(x) = A(nx). Define f=0 on [a, b]. Show that for 1 < p<oo, {f,} — fin L?[a, 5]. 
Let 1 < p<ooand fp belong to L?(R). For each natural number n, define f,(x) = fo(x—n) 
for all x. Define f =0 on R. Show that { f,} — f in L?(R). Is this true for p = 1? 


Let E be a measurable set, { f,} a sequence in L?( E) and f belong to L*( E). Suppose 


jim, | f-fim, [ R= P 


Show that { f,,} converges strongly to f in L?(E). 


Let E be a measurable set and 1 < p < oo. Suppose { f,} is a bounded sequence in L?(E) 
and f belongs to L?( E). Consider the following four properties: (i) { f,} converges pointwise 
to f almost everywhere on E, (ii) {f,} — f in L?(E), (ii) {Il frllp} converges to {|| frllp}, and 
(iv) {fn} > f in L?(E). If {f,} possesses two of these properties, does a subsequence possess 
all four properties? 


Let X be a normed linear space and { f,,} — f in X. Suppose {|| f, ||} is unbounded. Show that, 
by possibly taking a subsequence and relabeling, we may suppose || f, || > a, =n - 3” for all 
n. Then show that, by possibly taking a further subsequence and relabeling, we may suppose 
(I fnll/an} > a@ € [1, co]. Define g, = a,/||fnll - f, for each n. Show that {g,} converges 
weakly and ||g,|| = 7 - 3” for all n. 


For 1 < p< ©, let {Z,,} be a bounded sequence in ¢? and ¢ belong in ¢?. Show that {Z,,} — £ 
in £? if and only if it converges componentwise, that is, for each index k, 


Jim, ck = ¢* where 2, =(0,02,...) and =(0',2,...). 


Let 1 < py < p2 < 0, {f,} be a sequence in L’2[0, 1] and f belong to L’2[0, 1]. What is the 
relationship between { f,} — f in L?2[0, 1] and {f,} — f in L”[0, 1]? 


For 1 < p < © and each index n, let e, € £? have nth component 1 and other components 
vanish. Show that if p> 1, then {e,} converges weakly to 0 in @?, but no subsequence 
converges strongly to 0. Show that {e,} does not converge weakly in £'. 


State and prove the Radon-Riesz Theorem in £7. 


Let [a, b] be a closed, bounded interval. Suppose {f,} — f in Cla, b]. Show that {f,} 
converges pointwise on [a, b] to f. 


Let [a, b] be a closed, bounded interval. Suppose {f,} — f in L™[a, b]. Show that 


x Xx 
lim | Sn =| f for all x €[a, dD]. 
n> oo a a 
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25. Let X be anormed linear space. Suppose that for each f € X there is bounded linear functional 
T € X* for which T( f) = || f|| and ||T ||, =1. 


(i) Prove that if { f,} converges weakly in X to both f, and fy, then f; = fo. 
(ii) Prove that if { f,} —~ f in X, then || f|| < liminf || fyl]. 
26. (Uniform Boundedness Principle) Let E be a measurable set, 1 < p< 00, and q the conjugate 


of p. Suppose { f,} is a sequence in L?( E) such that for each g € L7(E), the sequence {f,, g- fn} 
is bounded. Show that {f,,} is bounded in L?(E). 


8.3 WEAK SEQUENTIAL COMPACTNESS 


As we observed in the beginning of the preceding section, for [a, b] a closed, bounded 
interval and 1 < p < ov, there are bounded sequences in L?[a, b] that fail to have any 
strongly convergent subsequences. However, for 1 < p < ov, there is the following seminal 
theorem regarding weak sequential convergence. 


Theorem 14 Let E be a measurable set and 1 < p < oo. Then every bounded sequence in 
L?(E) has a subsequence that converges weakly in L?( E) to a function in L?( E). 


Our proof of this weak sequential compactness result is based on the following 
theorem.” 


Helley’s Theorem Let X be a separable normed linear space and {T,,} a sequence in its dual 
space X* that is bounded, that is, there is an M > 0 for which 


[Tr( f)| < M- If] forall f in X and all n. (28) 


Then there is a subsequence {T,,} of {T,} and T in X* for which 
lim Tn, (f) = T(f) forall f in X. (29) 
—> 00 


Proof Let { nine be a countable subset of X that is dense in X. We infer from (28) that 
the sequence of real numbers {T,,( f; )} is bounded. Therefore, by the Bolzano-Weierstrass 
Theorem, there is a strictly increasing sequence of integers {s(1,”)} and a number a, for 
which 


tim Ts(1n)( fi) = a1. 
We again use (28) to conclude that the sequence of real numbers {7,1 ,)( f2)} is bounded, 


and so again by the Bolzano-Weierstrass Theorem, there is a subsequence {s(2,n)} of 
{s(1,)} and a number a2 for which 


km T5(2,n)(f2) = a for all j. 
n> 0o 


>This theorem was proved by Eduard Helley in 1912 for the special case X = C[a, b], normed by the maximum 
norm. In his 1932 book, Stefan Banach observed, providing a one-sentence proof, that the result holds for any 
separable normed linear space. 


172 Chapter8 = The L? Spaces: Duality and Weak Convergence 


We inductively continue this selection process to obtain a countable collection of strictly 
increasing sequences of natural numbers {{s( j,n ee and a sequence of real numbers {a} 
such that for each j, 
{s(j +1,n)} is a subsequence of {s(j,n)}, 
and 
him. Ts(jn)( Si) = 4). 


n> & 


For each index k, define n;, = s(k, k). Then for each j, {nx}?° ; 18 a subsequence of {s( j, k)} 
and hence 
im Tn, (fj) = a; for all j. 
> © 


Since {T,,} is bounded in X* and {T,, ( f)} is a Cauchy sequence for each f is a dense subset 
of X, {T,,(f)} is Cauchy for all f in X. The real numbers are complete. Therefore we may 
define 

T(f) = lim Tm (Ff) for all f € X. 


Since each T,, is linear, the limit functional T is linear. Since 
IT, (F)| < M - || fll for all k and all f eX, 
IT(F)| = lim [Tn (f)| <M - [fill for all f eX. 
Therefore T is bounded. O 


Proof of Theorem 14 Let q be the conjugate of p. Let { f,} be a bounded sequence in L?(E). 
Define X = L?( E). Let n be a natural number. Define the functional 7, on X by 


Tn (8) =| 5 -gfor gin X = L4(E). 


Proposition 2, with p and q interchanged and the observation that p is the conjugate of q, 
tells us that each T, is a bounded linear functional on X and ||T, lx = Il fnllp. Since {f,} 1s 
a bounded sequence in L?(E), {T,} is a bounded sequence in X*. Moreover, according to 
Theorem 11 of Chapter 6, since 1 < g < 00, X = L4(E) is separable. Therefore, by Helley’s 
Theorem, there is a subsequence {T,,,} and T € X* such that 


im Tn, (g) = T(g) for all gin X = L4(E). (30) 
— 00 


The Riesz Representation Theorem, with p and q interchanged, tells us that there is a 
function f in L?(E) for which 


T(g) =| f-g forall gin X = L’(E). 
E 
But (30) means that 


ox org 
lim [ty § [ir 8 for all g in L7(E), 


k— oo 


According to Proposition 6, { fn,} converges weakly to f in L?(E) O 
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As we see in the following example, for [a, b] a nondegenerate closed, bounded 
interval, a bounded sequence in L'[a, b] may fail to have a weakly convergent subsequence. 


Example For J = [0, 1] and a natural number n, define J, = [0, 1/n] and f, =n- xj,. 
Then { fn} is a bounded sequence in L![0, 1] since || fnl|1 = 1 for all n. We claim that { f,} 
fails to have a subsequence that converges weakly in L'[0, 1]. Indeed, suppose otherwise. 
Then there is a subsequence { f,,} that converges weakly in L'{0, 1] to f € L+[0, 1]. For each 
[c, d] C[0, 1], integration against y;,. q) is a bounded linear functional on L*{0, 1]. Thus 


[r= fim [tu 


d 
| f =0forall0<c<d<1. 


Therefore 


It follows from Lemma 13 of Chapter 5 that f = 0 almost everywhere on [0, 1]. Therefore 


1 1 
o=/ f= tim | fy =1, 
0 k— 00 0 


This contradiction shows that { f,} has no weakly convergent subsequence. 


Definition A subset K of anormed linear space X is said to be weakly sequentially compact 
in X provided every sequence { f,} in K has a subsequence that converges weakly to f € K. 


Theorem 15 Let E be a measurable set and 1 < p < oo. Then 
{feL?(E)| \fllp <1} is weakly sequentially compact in L?(E). 


Proof Let {f,} be a sequence in L?( E) for which || f,||p < 1 for all n. Theorem 14 tells us 
that there is a subsequence { f,, } which converges weakly to f € L?( E). Moreover, || f||p < 1 
since, by (17), 

fllp < liminf || fully <1. 0 


Remark While a general bounded sequence in L'(E), does not have a weakly convergent 
subsequence, a theorem of Dunford and Pettis, which we prove in Section 5 of Chapter 19, 
tells us that any bounded sequence in L'(E) that is uniformly integrable possesses a weakly 
convergent subsequence. 


PROBLEMS 


27. Let [a, b] be a nondegenerate closed, bounded interval. In the Banach space C[a, b], normed 
by the maximum norm, find a bounded sequence that fails to have any strongly convergent 
subsequence. 


28. For 1 < p < ov, find a bounded sequence in ¢? that fails to have any strongly convergent 
subsequence. 
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29. Let E be a measurable set that contains a nondegenerate interval. Show that there is a 
bounded sequence in L'(£) that fails to have a weakly convergent subsequence. Exhibit 
a measurable set E for which every bounded sequence in L!(E) has a weakly convergent 
subsequence. 


30. Let X be a normed linear space, {7,,} be a sequence in X*, and T belong to X*. Show that 
{T,} + T with respect to the || - ||, norm if and only if 


lim T(f) =T(f) uniformly on {f eX | [fl <1}. 


31. Is the sequence defined in the last example of this section uniformly integrable? 
32. For p = 1, at what point does the proof of Theorem 14 fail? 


33. Show that in £?,1 < p< oo, every bounded sequence in £? has a weakly convergent 
subsequence. 


34. Let { f,} be a sequence of functions on [0, 1], each of which is of bounded variation and for 
which {TV( f,,)} is bounded. Show that there is a subsequence { f,,} with the property that 
for each continuous function g on [0, 1], the sequence of integrals { I g(x) dfn, (x) dx} is 
Cauchy. 

35. Let X be a normed linear space and {T,,} a sequence in X* for which there is an M > 0 such 
that ||7,||. < M for all n. Let S be a dense subset of X such that {T,,(g)} is Cauchy for all 
ges. 

(i) Show that {7,,(g)} is Cauchy for all g€ X. 


(ii) Define T(g) = lim, -. « T,(g) for all g € X. Show that T is linear. Then show that T is 
bounded. 


36. Show that the conclusion of Helley’s Theorem is not true for X = L™[0, 1]. 


37. Let E have finite measure and 1 < p < oo. Suppose {f,} is a bounded sequence in L?(E) 
and f belongs to L?(£). If one of the following properties holds, determine, for each of the 
other properties, if a subsequence has that other property. The cases p = 1 and p > 1 should 
be considered. 

(i) {fn} > fim L?(E). 
(ii) {fn} > fin L?(E). 
(ili) {f.} > f pointwise a.e. on E. 


(iv) {fn} > f in measure. 


8.4 THE MINIMIZATION OF CONVEX FUNCTIONALS 


The L? spaces were introduced by Frigyes Riesz as part of a program to formulate for 
functionals and mappings defined on infinite dimensional spaces appropriate versions of 
properties possessed by functionals and mappings defined on finite dimensional spaces. The 
initial goal was to provide tools with which to analyze integral equations. This program 
was particularly successful for linear functionals and mappings and indeed the subject of 
linear algebra matured into the subject called linear functional analysis. However, beyond 
linear functionals, just as convex functions defined on convex sets of real numbers possess 
quite special properties, convex functionals defined on convex subsets of the L? spaces also 
possess special features. In this section we consider a minimization principle for such convex 
functionals. 
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Let E be a measurable set and 1 < p < oo. We have exhibited sequences in L?(E) 
that converge weakly but have no subsequences that converge strongly. In view of this, the 
following theorem is somewhat surprising. 


The Banach-Saks Theorem Let E be a measurable set and 1 < p < 00. Suppose { fn} — f 
in LP(E). Then there is a subsequence {f,,} for which the sequence of arithmetic means 
converges strongly to f in L?( E), that is, 


lim Sny + Say +--+ Sry 


k-> 0 k 


= f strongly in L?(E). 


Proof We present the proof for the case p = 2.° By replacing each f, with f, — f, we 
suppose f = 0. Theorem 7 tells us that { f,} is bounded in L*(E). Choose M > 0 for which 


[ 4 sMitoralln 
E 


Define nj = 1. Since { f,} converges weakly in L7(E) to 0 and fy, belongs to L?(E ) (here, 


of course, p = q = 2), we can choose a natural number nz > n1 for which | te fny* Snp 
Suppose we have chosen natural numbers n, < n> < ... <n, such that 


<1. 


[cto hy? cay Mitr peta 
E 


Since fn, +... + fa, belongs to L?(E) and { f,} converges weakly in L2(E) to 0, we may 
choose a natural ny.) > n, for which 7 


J Suu Sou) Irus $1 — (31) 
However, 
[tatoo fas)? = [too J? 
+2 f Um +. + fig) * Fagg + +fia. Fat’ 
and therefore _ 


| (fay +. +++ Situ)” S 2k + Mk +24 M =2(k +1) 4+ M(k +1). 
E | | 
The subsequence { f,,} has been inductively chosen so that - 


i fa fa +..t+ fn). (2+) 
E 


, < , for all k, 


Therefore the sequence of arithmetic means of { f,,} converges strongly to f=0in L?(E£).0 


®For the proof for p #2, see the original paper by S. Banach and S. Saks in Studia Math., vol 2, 1930. 
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Definition A subset C of a linear space X is said to be convex provided whenever f and g 
belong to C and d ¢€ [0, 1], then Af + (1 — A)g also belongs to C. 


Definition A subset C of a normed linear space X is said to be closed provided whenever { fn} 
is a Sequence in X that converges strongly in X to f, then if each f,, belongs to C, the limit f 
also belongs to C. 


Example Let E be a measurable set, 1 < p < oo, and g nonnegative function in L?(E), 
define 
C = {f measurable on E | |f| < ga.e.on E}. 


We claim that C is a closed, convex subset of L?(E). Indeed, we infer from the integral 
comparison test that each function in C belongs to L?(E). It is clear that C is convex. To 
verify that C is closed, let {f,,} be a sequence in L?(E) that converges in L?(E) to f. By 
the Riesz-Fischer Theorem, there is a subsequence of { f,} that converges pointwise almost 
everywhere on E to f. From this pointwise convergence it follows that f belongs to C. 


Example Let E be a measurable set and 1 < p< oo. Then B= {fe L?(E)||lfllp < 1} is 
closed and convex. To see it is convex just observe that if f and g belong to B and A € [0, 1], 
then, by the Minkowski Inequality, 


AF + (1-A)gilp SAlfllp + (1-A)llgllp <1. 


To see that B is closed observe that if { f,} is a sequence in B that converges in L?(E) to 
f ¢L?(E), then it follows from Minkowski’s Inequality that for each n, ||| full p—Ifllpl < 
Il fn — fllp, so that {|| fullp} converges to || fl|p. Thus || fllp < 1. 


Definition A real-valued functional T defined on a subset C of a normed linear space X is 
said to be continuous provided whenever a sequence { f,} in C converges strongly to f €C, 


then {T( fn)) > T(f). 


In the very special case of a linear functional, continuity is equivalent to boundedness. 
In general, these concepts are unrelated. 


Definition A real-valued functional T defined on a convex subset C of a normed linear space 
X is said to be convex provided whenever f and g belong to C and d € [0, 1], 


T(Af +(1—A)g) < AT(f)+(1-A)T(g). 


In any normed linear space, the triangle inequality is equivalent to the convexity of the 
norm. 


Example Let E be a measurable set and 1 < p < oo. Suppose g is a continuous, convex real- 
valued function defined on R for which there are constants a and b such that |p(s)| < a+|s|? 
for all real numbers s. Define the functional T on L?(E) by 


r(f)= | eo f forall fe1°(E). 
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We leave it as an exercise (see Problem 42) to show that T is properly defined, continuous, 
and convex. 


Lemma 16 Let E be a measurable set and 1 < p < oo. Suppose C is a closed, bounded convex 
subset of L?(E) and T is a continuous convex functional on C. If { f,} is a sequence in C that 
converges weakly in L?( E) to f, then f also belongs to C. Moreover, 


T( f) < liminf T(f, ). 


Proof By the Banach-Saks Theorem, there is a subsequence of {f,} whose sequence of 
arithmetic means converges strongly in L?( E) to f. The arithmetic means belong to C since 
C is convex and therefore, since C is closed, the function f belongs to C. Moreover, there 
is a further subsequence of {T( f,, )} that converges to a = liminf T( f,). Therefore, we may 
choose a subsequence such that 


lim Sny + Say He + Sy 


k—> oo k 


= f strongly in L?(E) 
and 
lim T(fn,) =a. 
k> © 
Since the functional T is continuous, 


r(f)= jim, 7 (S14 oF In) 


k 


Moreover, the arithmetic means of a convergent sequence of real numbers converge to the 
same limit and therefore 


lim T( fay) +T( Sng) + °° + T( Sry) _ 


k— oo k 


On the other hand, since T is convex, for each k, 


k 


~ k 
Thus 


T(f)= jim, 7 (S1* feb fn) 


< lim T( for) +T( fag) ++ +T( Sry) _ 


~ k-> 00 k 


O 


Theorem 17 Let E be a measurable set and 1 < p < oo. Suppose C is a closed, bounded 
convex subset of L?(E) and T is a continuous convex functional on C. Then T takes on a 
minimum value on C, that is, there is a function fy € C such that 


T( fo) < T(f) forall f Ec. 
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Proof We first show that the image T(C) is bounded below. Indeed, otherwise there is a 
sequence { f,,} in C such that lim, -, 0 T( fn) = —oo. Since C is bounded, by possibly taking 
a subsequence, we use Theorem 14 to suppose that {f,} converges weakly in L?(E) toa 
function f in L?(E). We infer from the preceding lemma that f belongs to C and that 


T( f) < liminf T( f,) = —oo. 
This is a contradiction. Thus 7 is bounded below on C. Define 
c=inf {T(f)| fect}. 


Choose a sequence {f,,} in C such that lim, -, . T(f,) = c. Again, by possibly taking a 
subsequence we may invoke Theorem 14 to suppose that { f,,} converges weakly in L?( EF) to 
a function fp in L?(E). We infer from the preceding lemma that fy belongs to C and 


T( fo) < liminf T( f,) =. 
Thus T( fo) =. CJ 


Corollary 18 Let E be a measurable set of finite measure and 1 < p < oo. Suppose ¢ is a 
real-valued continuous convex function on R for which there are constants c,; > 0 and cp > 0 
such that 


lp(s)| < ¢1 +c: |s|? for all s. (32) 
Then there is a function fy € L?( E) with || follp < 1 for which 
[eon= min [oor (33) 
E fELP(E) Nflpst JE 


Proof If f is a measurable real-valued function on E, since ¢ is continuous, the composition 
yo f is measurable. Let f belong to L?( E). Since f is finite a.e. on E, we infer from (32) that 


lpo fl<cy+o-|f|? ae. on E. 


Thus, by the integral comparison test, g o f is integrable over E. Define the functional T on 
L?(E) by 


r(f)= | eo f forall fe1"(B), 


Then T is properly defined and it inherits convexity from ~. We already noted that the 
set C = {f €L?(E)||lfllp < 1} is strongly closed, bounded, and convex. The existence of 
a minimizer for T on C will be a consequence of the preceding theorem if we show that 
T is continuous in L?(E). Let {f,,} be a sequence in L?(E) that converges strongly to f 
in L?(E). By taking a subsequence if necessary and relabeling, we suppose { f,} is rapidly 
Cauchy. Therefore, according to Theorem 6 of Chapter 7, { f,} converges pointwise a.e. on 
E to f. Since ¢ is continuous, {g o f,} converges pointwise a.e. on E to go f. Moreover, by 
the completeness of L4( E), since {f,} is rapidly Cauchy in L?(E), the function 


Lo, @) 
g= Dif 
k=l 


aR SE SE A eT TERETE Bde RAB PERE a ES et aE er 
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belongs to L?(E). It is clear that 
[fn] < g ae. on E for all n, 


and hence, by the inequality (33), 
lpo fal <c1 +02: |fal? < cy +02 -g? ae. on E for all n. 


We infer from the Dominated Convergence Theorem that 


lim voh=| oof 
E E 


n> © 


Therefore T is continuous on L?(E). CL] 


Remark We proved the Bolzano-Weierstrass Theorem in Chapter 1: every bounded sequence 
of real numbers has a convergent subsequence. This theorem is the basis of the argument 
that every continuous real-valued function on a closed, bounded interval takes a minimum 
value. In the mid nineteenth century it was uncritically assumed that a similar argument was 
valid for establishing minimum values of real-valued functionals on spaces of functions. Karl 
Weierstrass observed the fallacy in this argument. Given a sequence of continuous functions 
{fn} on [0, 1] for which 


1 
[ Ifal? <1 for all n, 
0 


there may not exist a subsequence { fn,} and f € L*{0, 1] for which 


1 
im lfng — FP =O: 
k>oo Jo 


(see Problem 45). Many mathematicians, including David Hilbert, turned their attention to 
investigating specific classes of functionals for which it is possible to prove the existence of 
minimizers.’ Theorem 17 exhibits one such class of functionals. 


PROBLEMS 


38. For 1 < p< oo and each index n, let e, € £? have nth component 1 and other components 0. 
Show that {e,} converges weakly to 0 in £?, but no subsequence converges strongly to 0. Find 
a subsequence whose arithmetic means converge strongly to 0 in 2”. 


39. Show that if a sequence of real numbers {a,} converges to a, then the sequence of arithmetic 
means also converges to a. 


40. State and prove the Banach-Saks Theorem in @?. 
41. Let E be a measurable set and 1 < p< oo. Let T be a continuous linear functional on L?[a, 5] 
and K = {feL?(E)|\|fllp < 1}. Find a function fo € K for which 


T( fo) => T(f) for all f in K. 


THilbert’s article On the Dirichlet Principle is translated in A Source Book in Classical Analysis by Garrett 
Birkhoff (Harvard University Press, 1973). 
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42. (Nemytskii) Let E be a measurable set and pj, p2 belong to [1, 00). Suppose ¢ is a 
continuous real-valued function defined on R for which there are constants c, and c such that 
lp(s)| <cy + co|s|?1/P2 for all real numbers s. Let {fn} be a sequence in L?!( E). Show that 


if {fn}—> f in L?!(E), then {yo fy} > go fin L™. 


43. (Beppo Levi) Let E be a measurable set, 1 < p < 00, and C a closed bounded convex subset 
of L?( E). Show that for any function fp € L?(E), there is a function gp in C for which 


lgo — follp < lg — follp for all gin C. 
44. (Banach-Saks) For a natural number n, define the function f,, on [0, 1] by setting 
fa(x) = 1 for k/2” +1/27"*1 <x <(k+1)/2" and0 <k <2" -1, 
and f,(x) = 1 —2"+! elsewhere on [0, 1]. Define f =0 on [0, 1]. 
(i) Show that 
[i 
0 


im, | i= | f for all x € [0, 1]. 
n—>CO 0 0 


< 1/2" for all x € [0, 1] and all n, 


and therefore 


(ii) Define E to be the subset of [0, 1] on which f, = 1 for all n. Show that 
| fn =m(E) > 0 for all n. 
E 


(iii) Show that || fll, < 2 for all n. Infer from part (ii) that {f,} is a bounded sequence 
in L1[0, 1] that does not converge weakly in L'[0, 1] to f. Does this and part (i) 
contradict Theorem 11. 

(iv) For 1 < p< oo, infer from part (ii) that { f,} is a sequence in L?[0, 1] that does not 

: converge weakly in L?[0, 1] to f. Does this and part (i) contradict Theorem 11? 


45. Find a sequence {gn} in L[0, 1] that has no Cauchy subsequence. Use this subsequence 
and the denseness of the continuous functions in L?[0, 1] to find a sequence of continuous 
functions on [0, 1] for which no subsequence converges in L7[0, 1] to a function in L7[0, 1]. 
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In Chapter 1 we established three types of properties of the real numbers. The first type are 
the algebraic properties related to addition and multiplication. The second are the properties 
of the positive numbers by way of which the concepts of order and absolute value are defined. 
Using the algebraic and order properties, the distance between two real numbers is defined 
to be the absolute value of their difference. The final property possessed by the real numbers 
is completeness: the Completeness Axiom for the real numbers is equivalent to the property 
that every Cauchy sequence of real numbers converges to a real number. In the study of 
normed linear spaces, which was begun in Chapter 7, the algebraic structure of the real 
numbers is extended to that of a linear space; the absolute value is extended to the concept of 
a norm, which induces a concept of distance between points; and the order properties of the 
real numbers are left aside. We now proceed one step further in generalization. The object 
of the present chapter is to study general spaces called metric spaces for which the notion 
of distance between two points is fundamental. There is no linear structure. The concepts of 
open set and closed set in Euclidean space extend naturally to general metric spaces, as do 
the concepts of convergence of a sequence and continuity of a function or mapping. We first 
consider these general concepts. We then study metric spaces which possess finer structure: 
those that are complete, compact, or separable. 


9.1 EXAMPLES OF METRIC SPACES 


Definition Let X be a nonempty set. A function p: X X X > R is called a metric provided for 
all x, y, and z in X, 
(i) p(x, y) = 0; 
(ti) p(x, y) = Oif and only if x = y; 
(iii) p(x, y) = p(y, x); 
(iv) p(x, y) < p(x, z) + p(z, y). 


A nonempty set together with a metric on the set is called a metric space. 
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We often denote a metric space by (X, p). Property (iv) is known as the triangle 
inequality for the metric. The quintessential example of a metric space is the set R of all real 
numbers with p(x, y) = |x — yl. 


Normed Linear Spaces In Section 7.1 we extended the concept of absolute value to a 
general linear space. Recall that a nonnegative real-valued function || - || on a linear space X 
is called a norm provided for each u, v€ X and real number a, 


(i) ||u|| = Oif and only if u = 0. 
(ii) |lu+ vl] < lull + loll. 
(iii) lau] = |alllul). 


We called a linear space with a norm a normed linear space. A norm || - || on a linear space 
X induces a metric p on X by defining 


p(x, y) = ||x — y|| for all x, ye X. (1) 


Property (ii) of a norm is called the triangle inequality for the norm. It is equivalent to the 
triangle inequality for the induced metric. Indeed, for x, y,z€X, setu=x—zandv=z-—y 
and observe that 


lu + ull < [ul] + [lvl] if and only if p(x, y) < p(x, z) + p(z, y). 


Three prominent examples of normed linear spaces are Euclidean spaces R", the L?(E) 
spaces, and C[a, b]. For a natural number n, consider the linear space R” whose points are 
n-tuples of real numbers. For x = (x1, ..., X,) in R" the Euclidean norm of x, ||x||, is defined 
by 

xl] = [x2 +. HZ PY? 


We devoted Chapters 7 and 8 to the study of the normed linear spaces L?(E),for 1 < p < 00 
and E a Lebesgue measurable set of real numbers. For 1 < p < oo, the triangle inequality 
for the L?(E) norm is called the Minkowski Inequality. For a closed bounded interval of 
real numbers [a, b], consider the linear space C[a, b] of continuous real-valued functions on 
[a, b]. The maximum norm || - ||max is defined for f € C[a, b] by 


Il flmax = max {| f(x)| | xe[a, b]}. 


The triangle inequality for the maximum norm follows from this inequality for the absolute 
values on the real numbers. : 


The Discrete Metric For any nonempty set X, the discrete metric p is defined by 
setting p(x, y) = Oifx = yand p(x, y) =1lifx#y. 


Metric Subspaces_ For a metric space (X, p), let Y be a nonempty subset of X. Then 
the restriction of p to Y x Y defines a metric on Y and we call such a metric space a 
metric subspace. Therefore every nonempty subset of Euclidean space, of an L?( E) space, 
1 < p < ~, and of C[a, b] is a metric space. 
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Metric Products For metric spaces (X1, p1) and (Xz, p2), we define the product 
metric 7 on the Cartesian product X, X X2 by setting, for (x1, x2) and (yj, y2) in X X Xp, 


2( (21.32). (91.92) = {loves DP + lool, PP 


It is readily verified that 7 has all the properties required of a metric. This construction 
extends to countable products (see Problem 10). | 

A particularly interesting and useful example of a metric space that is not directly 
presented as a metric subspace of a normed linear space is the Nikodym metric on the 
collection of measurable subsets of a Lebesgue measurable subset of R (see Problem 5). 

On any nonempty set X consisting of more than one point there are different metrics. 
For instance, if X is a nonempty collection of continuous functions on the closed, bounded 
interval [a, b], then X is a metric space with respect to the discrete metric, with respect to 
the metric induced by the maximum norm and, for 1 < p < ©, with respect to the metric 
induced by the L?[a, b] norm. The following relation of equivalence between metrics on a 
set is useful. | | 


Definition Two metrics p and o ona set X are said to be equivalent provided there are positive. 
numbers c, and cz such that for all x1, x. € X, 


cy -0(x1,%2) < p(x, x2) <c2-o0(x1, x2). 


Definition A mapping f from a metric space (X, p) to a metric space (Y, o) is said to be an 
isometry provided it maps X onto Y and for all x1, x» € X, 


o( f(x1), f(x2)) = p(x1, x2). 


Two metric spaces are called isometric provided there is an isometry from one onto the 
other. To be isometric is an equivalence relation among metric spaces. From the viewpoint | 
of metric spaces, two isometric metric spaces are exactly the same, an isometry amounting 
merely to a relabeling of the points. 

In the definition of a metric p on a set X it is sometimes convenient to relax the 
condition that p(x, y) = 0 only if x = y. When we allow the possibility that p(x, y) = 0 for 
some x # y, we call p a pseudometric and (X, p) a pseudometric space. On such a space, 
define the relation x = y provided p(x, y) = 0. This is an equivalence relation that separates 
X into a disjoint collection of equivalence classes X/ = . For equivalence classes [x] and [y], 
define p ([x], [y]) = p(x, y). It is easily seen that this properly defines a metric pon X/=., 
Similar considerations apply when we allow the possibility, in the definition of a norm, that 
l|u|| = 0 for «#0. Some examples of pseudometrics and pseudonorms are considered in 
Problems 5, 7, 9, and 49. 


PROBLEMS 


1. Show that two metrics p and 7 on the same set X are equivalent if and only if there is ac > 0 
such that for all u, ve X, 


r(u, v) Sp(u, v) <cr(u, v). 
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2. Show that the following define equivalent metrics on R": 


3. 


4. 


p*(x, y) =I — yl +... + btn — yal 
pt (x, y) = max{|x1 ~ yl, seey [Xn ~ ynl}. 
Find a metric on R” that fails to be equivalent to either of those defined in the preceding 


problem. 


For a closed, bounded interval [a, b], consider the set X = C[a, b] of continuous real-valued 
functions on [a, b]. Show that the metric induced by the maximum norm and that induced by 
the L'[a, b] norm are not equivalent. 


. The Nikodym Metric. Let E be a Lebesgue measurable set of real numbers of finite measure, 


X the set of Lebesgue measurable subsets of E, and m Lebesgue measure. For A, Be X, 
define p( A, B) = m(AAB), where AAB = [A ~ B] U[B~ A], the symmetric difference of A 
and B. Show that this is a pseudometric on X. Define two measurable sets to be equivalent 
provided their symmetric difference has measure zero. Show that p induces a metric on the 
collection of equivalence classes. Finally, show that for A, Be X, 


p(A, B) =| Iva — Xal, 
E 


where x4 and yz are the characteristic functions of A and B, respectively. 


. Show that for a, b,c > 0, 


a b Cc 
<< 


ifa<b-+c, th 
wasore "Tra 14b 1te 


. Let E be a Lebesgue measurable set of real numbers that has finite measure and X the set of 


Lebesgue measurable real-valued functions on E. For f, g€ X, define 


_ lf —sl 
pf, )-/| 7. 


Use the preceding problem to show that this is a pseudometric on X. Define two measurable 
functions to be equivalent provided they are equal a.e.on E. Show that p induces a metric on 
the collection of equivalence classes. 


. For 0 < p <1, show that 


(a+b)? <a? +D? for alla, b> 0. 


. For E a Lebesgue measurable set of real numbers, 0< p <1, and g and h Lebesgue measurable 


functions on E that have integrable pth powers, define 


pp(h, 8) =| lg —Al?. 
E 


Use the preceding problem to show that this is a pseudometric on the collection of Lebesgue 
measurable functions on E that have integrable pth powers. Define two such functions to 
be equivalent provided they are equal a.e.on E. Show that p,(-, -) induces a metric on the 
collection of equivalence classes. 


latitude 


Rg 
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10. Let {(Xn, pn) r.,be a countable collection of metric spaces. Use Problem 6 to show that p, 
defines a metric on the Cartesian product Ire, Xn, where for points x = {x,} and y = {y,} in 
Tr X ns 


“1 Pn(Xn, yn ) 
pax y) = Si Palins Yn) 
. 2 7 1+ pn(xn, Yn ) 


11. Let (X, p) be a metric space and A any set for which there is a one-to-one mapping f of A 
onto the set X. Show that there is a unique metric on A for which f is an isometry of metric 
spaces. (This is the sense in which an isometry amounts merely to a relabeling of the points 
in a space.) 


12. Show that the triangle inequality for Euclidean space R” follows from the triangle inequality 
for L[0, 1]. 


9.2 OPEN SETS, CLOSED SETS, AND CONVERGENT SEQUENCES 
Many concepts studied in Euclidean spaces and general normed linear spaces can be naturally 
and usefully extended to general metric spaces. They do not depend on linear structure. 
Definition Let (X, p) be a metric space. For a point x in X and r > 0, the set 

B(x, r)={x'eX| p(x’, x) <r} 


is called the open ball centered at x of radius r. A subset O of X is said to be open provided for 
every point x €Q, there is an open ball centered at x that is contained in O. For a point x € X, 
an open Set that contains x is called a neighborhood of x. 


We should check that we are consistent here, namely, that an open ball is open. By the 
definition of open set, to show that B(x, r) is open it suffices to show that 


if x’ € B(x, r) andr’ =r — p(x’, x), then B(x’, r’) C B(x, r). 
To verify this, let ye B(x’, r’). Then p(y, x’) <7’, so that, by the triangle inequality, 


ply, x) < p(y, x) + p(x, x) <r’ +p(x', x) =r. 
Therefore B(x’, r’) C B(x, r). 


Proposition 1 Let X be a metric space. The whole set X and the empty-set % are open; the 
intersection of any two open subsets of X is open; and the union of any collection of open 
subsets of X is open. 


Proof It is clear that X and § are open and the union of a collection of open sets is open. 
Let O; and ©? be open subsets of X. If these two sets are disjoint, then the intersection 
is the empty-set, which is open. Otherwise, let x belong to O; 1 O>. Since O, and O> are 
open sets containing x, there are positive numbers 5, and 5) for which B(x, 5, ) C ©, and 
B(x, 62) C Op. Define 6 = min{5,, 52}. Then the open ball B(x, 6) is contained 0; N O>. 
Therefore O; N O> is open. UI 
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The following proposition, whose proof we leave as an exercise, provides a description, 
in the case the metric space X is a subspace of the metric space Y, of the open subsets of X 
in terms of the open subsets of Y. 


Proposition 2 Let X be a subspace of the metric space Y and E a subset of X. Then E is open 
in X if and only if E = X 10, where O is open in Y. 


Definition For a subset E of a metric space X, a point x € X is called a point of closure of E 
provided every neighborhood of x contains a point in E. The collection of points of closure of 
E is called the closure of E and is denoted by E. 


It is clear that we always have E C E. If E contains all of its points of closure, that is, 
E = E, then the set E is said to be closed. For a point x in the metric space (X, p) andr >0, 
the set B(x, r)={x' € X| p(x’, x) <r} is called the closed ball centered at x of radius r. It 
follows from the triangle inequality for the metric that B(x, r) is a closed set that contains 
B(x, r). In a normed linear space X we refer to B(0, 1) as the open unit ball and B(0, 1) as 
the closed unit ball. 


Proposition 3 For E a subset of a metric space X, its closure E is closed. Moreover, Eis the 
smallest closed subset of X containing E in the sense that if F is closed and E C F, thenE C F. 


Proof The set F is closed if it contains all its points of closure. Let x be a point of closure 
of E. Consider a neighborhood U, of x. There is a point x’ € EN U,. Since x’ is a point of 
closure of E and U, is a neighborhood of x’, there is a point x” € EN U,. Therefore every 
neighborhood of x contains a point of E and hence x € E. So the set E is closed. It is clear 
that if A C B, then A C B, and hence if F is closed and contains E, then E C F = F. O 


Proposition 4 A subset of a metric space X is open if and only if its complement in X is 
closed. 


Proof First suppose E is open in X. Let x be a point of closure of X ~ E. Then x cannot 
belong to E because otherwise there would be a neighborhood of x that is contained in 
E and thus disjoint from X ~ E. Thus x belongs to X ~ E and hence X ~ E is closed. Now 
suppose X ~ E is closed. Let x belong to E. Then there must be a neighborhood of x that is 
contained in E, for otherwise every neighborhood of x contains points in X ~ E and therefore 
x is a point of closure of X ~ E. Since X ~~ E is closed, x also belongs to X ~ E. This is a 
contradiction. LI 


Since X ~ [X ~ E] = E, it follows from the preceding proposition that a set is closed if 
and only if its complement is open. Therefore, by De Morgan’s Identities, Proposition 1 may 
be reformulated in terms of closed sets as follows. 


Proposition 5 Let X be a metric space. The empty-set ® and the whole set X are closed; the 
union of any two closed subsets of X is closed; and the intersection of any collection pf closed 
subsets of X is closed. 
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We have defined what it means for a sequence in a normed linear space to converge. 
The following is the natural generalization of convergence to metric spaces. 


Definition A sequence {x,} in a metric space (X, p) is said to converge to the point x € X 
provided 
lim p(x, x) =0, 
n> OO 


that is, for each € > 0, there is an index N such that for everyn > N, p(Xn, x) <€. The point 
to which the sequence converges is called the limit of the sequence and we often write {xn} > x 
to denote the convergence of {xn} to x. 


A sequence in a metric space can converge to at most one point. Indeed, given two 
points u, v in a metric space X, set r = p(u, v)/2. We infer from the triangle inequality for 
the metric p that B(u, r) and B(v, r) are disjoint. So it is not possible for a sequence to 
converge to both u and v. Moreover, convergence can be rephrased as follows: a sequence 
{x,} converges to the limit x provided that for any neighborhood O of X, all but at most 
finitely many terms of the sequence belong to ©. Naturally, for a subset E of X and a 
sequence {x,} such that x, belongs to E for all n, we say that {x,} is a sequence in E. 


Proposition 6 For a subset E of a metric space X, a point x € X is a point of closure of E if 
and only if x is the limit of a sequence in E. Therefore, E is closed if and only if whenever a 
sequence in E converges to a limit x € X, the limit x belongs to E. 


Proof It suffices to prove the first assertion. First suppose x belongs to E. For each natural 
number n, since B(x, 1/n) M E#@, we may choose a point, which we label x, that belongs 
to B(x,1/n) NM E. Then {x,} is a sequence in E and we claim that it converges to x. Indeed, 
let € > 0. Choose an index N for which 1/N < ¢. Then 


pP(xXn, x) <1f/n<1/N <eifn=N. 


Thus {x,} converges to x. Conversely, if a sequence in E converges to x, then every ball 
centered at x contains infinitely many terms of the sequence and therefore contains points 
in E.SoxeE. L 


In general, a change in the metric on a set will change what it means for a subset to be 
open and therefore what it means for a subset to be closed. It will also change what it means 
for a sequence to converge. For instance, for the discrete metric on a set X, every subset is 
open, every subset is closed, and a sequence converges to a limit if and only if all but a finite 
number of terms of the sequence are equal to the limit. The following proposition, the proof 
of which we leave as an exercise, tells us that for equivalent metrics on a set, the open sets 
are the same, and therefore the closed sets are the same and convergence of a sequence is 
the same. 


Proposition 7 Let p and o be equivalent metrics on a nonempty set X. Then a subset of X is 
open in the metric space (X, p) if and only if it is open in the metric space (X, a). 
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13. 


14. 
15. 


16. 
17. 
18. 


19. 


20. 


21. 


22. 


23. 


24. 


PROBLEMS 


In a metric space X, is it possible for r > 0 and two distinct points u and v in X to have 
B(u, r) = B(v, r)? Is this possible in Euclidean space R"? Is it possible in a normed linear 
space? 


Let (X, p) be a metric space in which {u,,} — u and {v, } > v. Show that {o(un, vn )}—> p(u, v). 


Let X be a metric space, x belong to X andr > 0. 
(i) Show that B(x, r) is closed and contains B(x, r). 


(ii) Show that in a normed linear space X the closed ball B(x, r) is the closure of the open 
ball B(x, r), but this is not so in a general metric space. 


Prove Proposition 2. 
Prove Proposition 7. 


Let X be a subspace of the metric space Y and A a subset of X. Show that A is closed in X if 
and only if A = XN F, where F 1s closed in Y. 


Let X be a subspace of the metric space Y. 


(i) If O is an open subset of the metric subspace X, is O an open subset of Y? What if X is 
an open subset of Y? 


(ii) If F is aclosed subset of the metric subspace X, is F a closed subset of Y? What if X is a 
closed subset of Y? 


For a subset E of a metric space X, a point x € X is called an interior point of E provided 
there is an open ball centered at x that is contained in E: the collection of interior points of E 
is called the interior of E and denoted by int E. Show that int E is always open and E is open 
if and only if E = int E. 


For a subset E of a metric space X, a point x X is called an exterior point of E provided 
there is an open ball centered at x that is contained in X ~ E: the collection of exterior points 
of E is called the exterior of E and denoted by ext E. Show that ext E is always open. Show 
that E is closed if and only if X ~ E = ext E. 


For a subset E of a metric space X, a point x € X is called a boundary point of E provided 
every open ball centered at x contains points in E and points in X ~ E: the collection of 
boundary points of E is called the boundary of E and denoted by bd E. Show (1) that bd E is 
always closed, (ii) that E is open if and only if EN bd E = @, and (iii) that E is closed if and 
only if bd E C E. 


Let A and B be subsets of a metric space X. Show that if A C B, then A CB. Also, show that 
(AUB) = AUB and (ANB) CANB. 


Show that for a subset E of a metric space X, the closure of E is the intersection of all closed 
subsets of X that contain E. 


9.3 CONTINUOUS MAPPINGS BETWEEN METRIC SPACES 


The following is the natural generalization of continuity for real-valued functions of a real 
variable. 


Definition A mapping f from a metric space X to a metric space Y is said to be continuous 
at the point x € X provided for any sequence {x,} in X, 


if {xn} —> x, then {f(xn)} > f(x). 
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The mapping f is said to be continuous provided it is continuous at every point in X. 


The following three propositions are generalizations of corresponding results for real- 
valued functions of a real variable and the proofs of the general results are essentially the 
same as the special cases. 


The €—6 Criterion for Continuity A mapping f from a metric space (X, p) to a metric space 
(Y, o) is continuous at the point x € X if and only if for every € > 0, there is a 5 > 0 for which 
if p(x, x’) <6, then o( f(x), f(x')) <é«, that is, 


f (B(x, 8)) C B( f(x), €). 


Proof First suppose f: X—>Y is continuous at x. We establish the €-—6 criterion by ar- 
guing by contradiction. Suppose there is some ¢€9 > 0 for which there is no positive 
number 6 for which f( B(x, 5)) C B( f(x), €9). In particular, if n is a natural num- 
ber, it is not true that f( B(x, 1/n)) C B( f(x), eo). This means that there is a point 
in X, which we label x,, such that p(x, x,) <1/n while o( f(x), f(%,)) = €0. This de- 
fines a sequence {x,} in X that converges to x, but whose image sequence { f(x, )} does 
not converge to f(x). This contradicts the continuity of the mapping f: X—> Y at the 
point x. 


To prove the converse, suppose the ¢-6 criterion holds. Let {x,} be a sequence in 
X that converges to x. We must show that {f(x,)} converges to f(x). Lete > 0. We 
can choose a positive number 6 for which f( B(x, 5)) C B( f(x), €). Moreover, since the 
sequence {x,} converges to x, we can select an index N such that x, € B(x, 6) forn > N. 
Hence f(x,) €B( f(x), €) for n > N. Thus the sequence { f(x, )} converges to f(x) and 
therefore f: X—>Y is continuous at the point x. U 


Proposition 8 A mapping f froma metric space X to a metric space Y is continuous if and 
only if for each open subset O of Y, the inverse image under f of O, f~'(O), is an open subset 
of X. 


Proof First assume the mapping f is continuous. Let O be an open subset of Y. Let 
x be a point in f~'(O); we must show that an open ball centered at x is contained 
in f-'(O). But f(x) is a point in O, which is open in Y, so there is some positive 
number r for which B(( f(x), r)) C O. Since f: XY is continuous at the point x, by 
the «—6 criterion for continuity at a point, we can select a positive number 6 for which 
f( B(x, 6)) C B( f(x), r) CO. Thus B(x, 5) C f-'(O) and therefore f~!(©) is open in X. 


To prove the converse, suppose the inverse image under f of each open set is open. Let 
x be a point in X. To show that f is continuous at x, we use the €—6 criterion for continuity. 
Let « > 0. The open ball B( f(x), €) is an open subset of Y. Thus f~!(B( f(x), €)) is open 
in X. Therefore we can choose a positive number 5 with B(x, 5) C f~'(B( f(x), €)), that 
is, f( B(x, 6)) C B( f(x), €). U 


Proposition9 The composition of continuous mappings between metric spaces, when defined, 
is continuous. 
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Proof Let f: XY be continuous and g: Y—> Z be continuous, where X,Y, and Z are 
metric spaces. We use the preceding proposition. Let O be open in Z. Since g is continuous, 
g !(O) is open in Y and therefore, since f is continuous, £7 (g71(O)) = (go f)7!(O) is 
open in X. Therefore go f is continuous. LI 


Definition A mapping from a metric space (X, p) to a metric space (Y, o) is said to be 
uniformly continuous, provided for every € > 0, there is a 5 > 0 such that for u, ve X, 


if p(u, v) <6, thena(f(u), f(v)) <e. 


We infer from the ¢-—6 criterion for continuity at a point that a uniformly continuous 
mapping is continuous. The converse is not true. 


Example A mapping f from a metric space (X, p) to a metric space (Y, o) is said to be 
Lipschitz provided there is a c > 0 such that for all u, ve X, 
o( f(u), f(v)) <c-p(u, v). 


A Lipschitz mapping is uniformly continuous since, regarding the criterion for uniform 
continuity, 5 = €/c responds to any € > 0 challenge. 


PROBLEMS 


25. Exhibit a continuous mapping that is not uniformly continuous and a uniformly continuous 
mapping that is not Lipschitz. 


26. Show that every mapping from a metric space (X, p) to a metric space (Y, o) is continuous 
if p is the discrete metric. 


27. Suppose there is a continuous, one-to-one mapping from a metric space (X, p) to a metric 
space (Y, o), where a is the discrete metric. Show that every subset of X is open. 


28. For a metric space (X, p), show that the metric p: X X X > R is continuous, where X X X has 
the product metric. 


29. Let z be a point in the metric space (X, p). Define the function f: X > R by f(x) = p(x, z). 
Show that f is uniformly continuous. 


30. Show that the composition of uniformly continuous mappings between metric spaces, when 
defined, is uniformly continuous. 


31. Show that a continuous mapping between metric spaces remains continuous if an equivalent 
metric is imposed on the domain and an equivalent metric is imposed on the range. 


32. For a nonempty subset E of the metric space (X, p) and a point x € X, define the distance 
from x to E, dist(x, E), as follows: 
dist(x, E) = inf {p(x, y)| ye E}. 


(i) Show that the distance function f: X > R defined by f(x) = dist(x, E), for x¢ X, is 
continuous. 


(ii) Show that {x € X| dist(x, E) =0} = E. 


33. Show that a subset E of a metric space X is open if and only if there is a continuous real-valued 
function f on X for which E = {xe X| f(x) >0.}. 
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34. Show that a subset E of a metric space X is closed if and only if there is a continuous 
real-valued function f on X for which E = f~!(0). 


35. Let X and Y be metric spaces. Prove that f: X—>Y is continuous if and only if f-!(C) is 
closed in X whenever C is closed in Y. 


36. Let X = Ca, b]. Define the function py: X > R by 


b 
wn= | fax for each f in X. 


Show that % is Lipschitz on the metric space X, where X has the metric induced by the 
maximum norm. 


9.4 COMPLETE METRIC SPACES 


By itself, the structure of a metric space is too barren to be fruitful in the study of interesting 
problems in mathematical analysis. It is remarkable, however, that by considering metric 
spaces that possess just one additional property, namely completeness, we can establish 
an abundance of interesting and important results. We devote the next chapter to three 
fundamental theorems for complete metric spaces. 


Definition A sequence {x,} in a metric space (X, p) is said to be a Cauchy sequence provided 
for each € > 0, there is an index N for which 


ifn,m > N, then p(xXn,Xm) <€. 


This generalizes the concept of Cauchy sequence we first considered, in Chapter 1, for 
sequences of real numbers and then, in Chapter 7, for sequences in a normed linear space. 
For general metric spaces, as in the case of a normed linear space, a convergent sequence is 
Cauchy and a Cauchy sequence is bounded (see Problem 37). 


Definition A metric space X is said to be complete provided every Cauchy sequence in X 
converges to a point in X. 


The completeness axiom for the real numbers is equivalent to the completeness of the 
metric space R. From this we infer that each Euclidean space R” is complete. Moreover, the 
Riesz-Fischer Theorem, proved in Section 7.3, tells us that for E a Lebesgue measurable set 
of real numbers and 1 < p < oo, L?(E) is complete. 


Proposition 10 Let [a, b] be a closed, bounded interval of real numbers. Then C [a, b], with 
the metric induced by the maximum norm, is complete. 


Proof Let {f,} be a Cauchy sequence in C[a, b]. First suppose there is a convergent series 
Dh 4 such that 


ll fet. — Sillmax < ay for all k. (2) 


Since 
n+k—1 


fn+k — fn = > [fj+1 — fj] for all n, k, 


j=n 
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n+k—1 00 
I fatk — fallmax< >) Wfie1 — fillmax < >) a; for all n, k. 
j=n j=n 
Let x belong to [a, b]. Then 
oO 
| fn+k(x) — fa(x)| < > a; for all n, k. (3) 


j=n 
The series ¥7° , a; converges, and therefore { f,,(x)} is a Cauchy sequence of real numbers. 
The real numbers are complete. Denote the limit of {f,(x)} by f(x). Take the limit as 
k — ov in (3) to conclude that 


If(x) — fn(x)| < S a; for all n and all x € [a, b]. 


j=n 


We infer from this estimate that {f,} converges uniformly on [a, b] to f. Since each f,, is 
continuous, so is f. The general case follows from the particular case by noting that a Cauchy 
sequence converges if is has a convergent sequence and every Cauchy sequence in C[a, b] 
has a subsequence for which (2) holds. 


In general, a subspace of a complete metric space is not complete. For instance, an 
open, bounded interval of real numbers is not complete, while R is complete. However, 
there is the following simple characterization of those subspaces that are complete. 


Proposition 11 Let E be a subset of the complete metric space X. Then the metric subspace 
E is complete if and only if E is a closed subset of X. 


Proof First suppose E is a closed subset of X. Let {x,} be a Cauchy sequence in E. Then 
{xn} can be considered as a Cauchy sequence in X and X is complete. Thus {x,} converges 
to a point x in X. According to Proposition 6, since E is a closed subset of X, the limit of 
a convergent sequence in E belongs to E. Thus x belongs to E and hence E is a complete 
metric space. 


To prove the converse, suppose E is complete. According to Proposition 6, to show 
E is a closed subset of X we must show that the limit of a convergent sequence in E also 
belongs to E. Let {x,} be a sequence in E that converges to x € X. But a convergent sequence 
is Cauchy. Thus, by the completeness of E, {x,} converges to a point in E. But a convergent 
Sequence in a metric space has only one limit. Thus x belongs to E. LI 


Theorem 12 The following are complete metric spaces: 


(i) Each nonempty closed subset of Euclidean space R". 
(ii) For E a measurable set of real numbers and 1 < p < 00, each nonempty closed subset 
of L?(E). 
(iii) Each nonempty closed subset of C[a, b]. 
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Definition For a nonempty subset E of a metric space (X, p), we define the diameter of E, 
diam E, by 
diam E = sup {p(x, y)| x,yeE}. 


We say E is bounded provided it has finite diameter. A descending sequence {E,}°°, of 
nonempty subsets of X is called a contracting sequence provided 


lim diam(E,,) = 0. 
n> &© 


The Nested Set Theorem of Chapter 1 tells us that the intersection of a contracting 
sequence of nonempty closed sets of real numbers consists of a single point. This generalizes 
as follows. 


The Cantor Intersection Theorem Let X be a metric space. Then X is complete if and only if 
whenever {F,}"°_, is a contracting sequence of nonempty closed subsets of X, there is a point 
x€X for which (\?-_, Fr = {x}. 


Proof First assume X is complete. Let {F,,}°°_, be a contracting sequence of nonempty closed 
subsets of X. For each index n, select x, € F,. We claim that {x,} is a Cauchy sequence. 
Indeed, let « > 0. There is an index N for which diam Fy < e. Since {F,}°°, is descending, 
if n,m > N, then x, and x,, belong to Fy and therefore p(x, xm) < diam Fy < ¢«. Thus 
{xn} is a Cauchy sequence. Since X is complete, this sequence converges to some x € X. 
However, for each index n, F, is closed and x; € F,, for k > n so that x belongs to F,. Thus x 
belongs to (2, F,. It is not possible for the intersection to contain two points for, if it did, 
lim, —-, o9 diam F,, #0. 


To prove the converse, suppose that for any contracting sequence {F,}°° , of nonempty 
closed subsets of X, there is a point x € X for which (\°°, F, = {x}. Let {x,} be a Cauchy 
sequence in X. For each index n define F,, to be the closure of the nonempty set {x; | k > n}. 
Then {F,} is a descending sequence of nonempty closed sets. Since {x,} is Cauchy, the 
sequence {F,} is contracting. Thus, by assumption, there is a point x in X for which 
{x} = Po, Fn. For each index n, x is a point of closure of {x,|k > n} and therefore any 
ball centered at x has nonempty intersection with {x,|k > n}. Hence we may inductively 
select a strictly increasing sequence of natural numbers {n;} such that for each index k, 
p(X, Xn, ) <1/k. The subsequence {x,,} converges to x. Since {x,} is Cauchy, the whole 
sequence {x,} converges to x (see Problem 38). Therefore X is complete. U 


A very rough geometric interpretation of the Cantor Intersection Theorem is that a 
metric space fails to be complete because it has “holes.” If X is an incomplete metric space, 
it can always be suitably minimally enlarged to become complete. For example, the set of 
rational numbers is not complete, but it is a dense metric subspace of the complete space R. 
As a further example, let X = C[a, b], now considered with the norm || - ||1, which it inherits 
from L'[a, b]. The metric space (X, p;) is not complete. But it is a dense metric subspace 
of the complete metric space L'[a, b]. These are two specific examples of a construction 
that has a quite abstract generalization. We outline a proof of the following theorem in 
Problem 49. 
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Theorem 13 Let (X, p) bea a metric space. Then there is a complete metric space (X , p) for 
which X is a dense subset of X and 


p(u, v) = p(u, v) forallu, vex. 


We call the metric space described above the completion of (X, p). In the context of 


metric spaces the completion is unique in the sense that any two completions are isometric 
by way of an isometry that is the identity mapping on X. 


37. 


38. 


39. 


42. 
43. 


45. 


PROBLEMS 
In a metric space X, show (i) that a convergent sequence is Cauchy and (ii) that a Cauchy 
sequence is bounded. 


In a metric space X, show that a Cauchy Sequence converges if and only if it has a convergent 
subsequence. | 


Suppose that {x,} is a sequence in a complete metric space (X, p) and for each index 
nN, P(Xn, Xn41) < 1/2". Show that {x,} converges. Does {x,} converge if for each index n, 
P( Xn, Xn41) <1/n? 


. Provide an example of a descending countable collection of closed, nonempty sets of real 
numbers whose intersection is empty. Does this contradict the Cantor Intersection Theorem? 


. Let p and o be equivalent metrics on a nonempty set X. Show that (X, p) is complete if and 


only if (X, o) is complete. 
Prove that the product of two complete metric spaces is complete. 


For a mapping f of the metric space (X, p) to the metric space (Y, 0), show that f is 
uniformly continuous if and only if for any two sequences {u,} and {v,} in X, 


if lim (un, Un) = 0, then tim o( f(un), f(vn))) = 0. 


. Use the outline below to prove the following extension property for uniformly continuous 


mappings: Let X and Y be metric spaces, with Y complete, and f a uniformly continuous 
mapping from a subset E of X to Y. Then f has a unique uniformly continuous extension to 
a mapping f of E to Y. 


(i) Show that f maps Cauchy sequences in E to Cauchy sequences in Y. 


(ii) For xe E, choose a sequence {x,} in E that converges to x and define f(x) to be the 
limit of { f(x, )}. Use Problem 43 to show that f(x) is properly defined. 


(iii) Show that f is uniformly continuous on E. 


(iv) Show that the above extension is unique since any two such extensions are continuous 
mappings on E that take the same values on the dense subset E of E. 


Consider the countable collection of metric spaces {(Xn, pn )}°°.,. For the Cartesian product 
of these sets Z=[] | Xn, define o on ZX Z by setting, for x = Gin). y={yn}, 


[o.¢) 
(x,y) = >) 2°" pf (xn, Yn) Where each p* = p,/(1+ pn). 
n=1 


(i) Show that o is a metric. 
(ii) Show that (Z, o) is complete if and only if each (Xp, Pn )i is complete. 


48. 


49. 


50. 
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. For each index n, define f,(x) = ax" + Bcos(x/n) for 0 < x < 1. For what values of the 
parameters a and B is the sequence { f,,} a Cauchy sequence in the metric space C[0, 1]? 


. Let D be the subspace of C[0, 1] consisting of the continuous functions f: [0, 1} - — R that are 


differentiable on (0, 1). Is D complete? 


Define £ to be the subspace of C[0, 1] consisting of the functions f: [0, 1] R that are 

Lipschitz. Is £ complete? 

For a metric space (X, p), complete the following outline of a proof of Theorem 13: 

(i) If {x,} and {y,} are Cauchy sequences in X, show that {0(%n, yn )} isa Cauchy sequence 
of real numbers and therefore converges. 


(ii) Define X’ to be the set of Cauchy sequences in X. For two Cauchy sequences in X, (%n} 
and {y,}, define p ‘(Un {Yn} ) = lim p(xn, yn ). Show that this defines a pseudometric p’ 
on X’. 


(iii) Define two members of X’ , that is, two Cauchy sequences {x,} and {y,} in X, to be 
equivalent, provided p’( {xn}, {y,}) = 0. Show that this is an equivalence relation in X’ 


~and denote by X the set of equivalence classes. Define the distance p between two 
equivalence classes to be the p’ distance between representatives of the classes. Show 


that p is properly defined and is a metric on X. 
(iv) Show that the metric space (X ,p) is complete. (Hint: If {x,} is a Cauchy sequence 


from X, we may assume [by taking subsequences] that p(x, %n41) < 27" for all n. 
If {(%n,m} ro tbm= ,; 18 a sequence of such Cauchy sequences that represents a Cauchy 


sequence in X, then the sequence {Xn} =I is a Cauchy sequence from X that represents 
the limit of the Cauchy sequences from X .) 


(v) Define the mapping h from X to X by defining, for x € X, h(x) to be the equivalence 
Class of the constant sequence all of whose terms are x. Show that h(X) is dense in X 
and that p(h(u), h(v)) = p(u, v) for all u, ve X. 

(vi) Define the set ¥ to be the disjoint union of X and X~h(X). Foru,ve X, define p p(u, v) 
as follows: p(u, v) = p(u, v) if u, ve Xx; p(u, v) = p(u, v) for u, veX~ h(X); and 
p(u, v) = p(h(u), v) forueX, ve X~ ~h(X). From the preceding two parts conclude 
that the metric space (X , p) is acomplete metric space containing (X, p) as a dense 
subspace. 


Show that any two completions of a metric space X are isometric by way of an isometry that 
is the identity mapping on X. 


9.5 COMPACT METRIC SPACES 


Recall that a collection of sets {E,}xca 18 Said to be a cover of a set E provided E CU), Ey. 
By a subcover of a cover of E we mean a subcollection of the cover which itself also is a 


cover of E. If E is a subset of a metric space X, by an open cover of E we mean a cover of 


E consisting of open subsets of X. The concept of compactness, examined in Chapter 1 for 
sets of real numbers, generalizes as follows to the class of metric spaces. 


Definition A metric space X is called compact provided every open cover of X has a finite 
subcover. A subset K of X is called compact provided K, considered as a metric subspace of 
X, is compact. . 
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An open subset of the subspace K of a metric space X is the intersection of K with an 
open subset of X. Therefore a subset K of a metric space X is compact if and only if each 
cover of K by a collection of open subsets of X has a finite subcover. 

If T is a collection of open subsets of a metric space X, then the collection F of 
complements of sets in T is a collection of closed sets. Moreover, T is a cover if and only if F 
has empty intersection. Thus, by De Morgan’s Identities, a metric space X is compact if and 
only if every collection of closed sets with a nonempty intersection has a finite subcollection 
whose intersection also is nonempty. A collection F of sets in X is said to have the finite 
intersection property provided any finite subcollection of F has a nonempty intersection. 
Thus we may formulate compactness in terms of collections of closed sets as follows. 


Proposition 14 A metric space X is compact if and only if every collection F of closed subsets 
of X with the finite intersection property has nonempty intersection. 


Definition A metric space X is said to be totally bounded provided for each € > 0, the space 
X can be covered by a finite number of open balls of radius ¢. A subset E of X is called totally 
bounded provided that E, considered as a subspace of the metric space X, is totally bounded. 


For a subset E of a metric space X, by an €-net for E we mean a finite collection of open 
balls {B(x;,, €)}?_, with centers x; in X whose union covers E. We leave it as an exercise to 
show that the metric subspace E is totally bounded if and only if for each € > 0, there is a 
finite «-net for E. The point of this observation is that regarding the criterion for a metric 
subspace E to be totally bounded it is not necessary to require that the centers of the balls 
in the net belong to E. 

If a metric space X is totally bounded, then it is bounded in the sense that its diameter 
is finite. Indeed, if X is covered by a finite number of balls of radius 1, then we infer from 
the triangle inequality that diam X < c, where c = 2+d, d being the maximum: distance 
between the centers of the covering balls. However, as is seen in the following example, a 
bounded metric space need not be totally bounded. 


Example Let X be the Banach space ¢* of square summable sequences. Consider the 
closed unit ball B = {{x,} € £7 | I{xn}{l2 < 1}. Then B is bounded. We claim that B is not 
totally bounded. Indeed, for each natural number n, let e, have nth component 1 and other 
components 0. Then |{@n — em ||2 = /2if m#n. Then B cannot be contained in a finite number 
of balls of radius r < 1/2 since one of these balls would contain two of the €n’S, Which are 
distance /2 apart and yet the ball has diameter less than 1. 


Proposition 15 A subset of Euclidean space R" is bounded if and only if it is totally bounded. 


Proof It is always the case that a totally bounded metric space is bounded. So let E be a 
bounded subset of R”. For simplicity take n = 2. Let € > 0. Since E is bounded, we may take 
a > 0 large enough so that E is contained in the square [—a, a]x[—a, a]. Let P; be a partition 
of [~a, a] for which each partition interval has length less than 1/k. Then P; X P; induces a 
partition of [—a, a] x[—a, a] into closed rectangles of diameter at most /2/k. Choose k such 
that /2/k < €. Consider the finite collection of balls of radius € with centers (x, y) where 
x and y are partition points of P;. Then this finite collection of balls of radius € covers the 
square [—a, a] X[—a, a] and therefore also covers E. a 
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Definition A metric space X is said to be sequentially compact provided every sequence in 
X has a subsequence that converges to a point in X. 


Theorem 16 (Characterization of Compactness for a Metric Space) For a metric space X, 
the following three assertions are equivalent: 


(i) X is complete and totally bounded; 
(ii) X is compact; 
(tii) X is sequentially compact. 


For clarity we divide the proof into three propositions. 
Proposition 17 [fa metric space X is complete and totally bounded, then it is compact. 


Proof We argue by contradiction. Suppose {Oy},<, is an open cover of X for which there 
is no finite subcover. Since X is totally bounded, we may choose a finite collection of open 
balls of radius less than 1/2 that cover X. There must be one of these balls that cannot be 
covered by a finite subcollection of {O,},<,. Select such a ball and label its closure F;. Then 
F, is closed and diam F, < 1. Once more using the total boundedness of X, there is a finite 
collection of open balls of radius less than 1/4 that cover X. This collection also covers F;. 
There must be one of these balls whose intersection with F, cannot be covered by a finite 
subcollection of {O,},-,4. Define F> to be the closure of the intersection of such a ball with 
F,. Then F; and F) are closed, Fy C Fi, and diam F; < 1, diam F) < 1/2. Continuing in 
this way we obtain a contracting sequence of nonempty, closed sets {F,} with the property 
that each F,, cannot be covered by a finite subcollection of {O,},c¢,. But X is complete. 
According to the Cantor Intersection Theorem there is a point xo in X that belongs to the 
intersection (\° , F,. There is some Ao such that QO), contains xq and since O), is open, there 
is a ball centered at xo, B(xo, r), such that B(xo, r) C Oa,. Since lim, -, 4. diam F, = 0 and 
x0 € (\n—1 Fn, there is an index n such that F,, C O,,. This contradicts the choice of F, as 
being a set that cannot be covered by a finite subcollection of {O)},<,. This contradiction 
shows that X is compact. CI 


Proposition 18 [fa metric space X is compact, then it is sequentially compact. 


Proof Let {x,} be a sequence in X. For each index n, let F,, be the closure of the nonempty 
set {xz |k = n}. Then {F,,} is a descending sequence of nonempty closed sets. According to 
the Cantor Intersection Theorem there is a point x9 in X that belongs to the intersection 
Mpo1 Fn. Since for each n, xo belongs to the closure of {x;|k > n}, the ball B(xo, 1/k) has 
nonempty intersection with {x;,|k > n}. By induction we may select a strictly increasing 
sequence of indices {n;} such that for each index k, p(x0, Xn, ) < 1/k. The subsequence {x,, } 
converges to x9. Thus X is sequentially compact. 


Proposition 19 Jf a metric space X is sequentially compact, then it is complete and totally 
bounded. 


Proof We argue by contradiction to establish total boundedness. Suppose X is not totally 
bounded. Then for some ¢ > 0 we cannot cover X by a finite number of open balls of radius «. 
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Select a point x; in X. Since X is not contained in B(x;, €), we may choose x2 € X for which 
p(x1, x2) > €. Now since X is not contained in B(x;, €) U B(x2, €), we may choose x3 € X 
for which p(x3, x2) > € and p(x3, x; ) > ¢. Continuing in this way we obtain a sequence {x,} 
in X with the property that p(x, x,) > « forn >k. Then the sequence {x,} can have no 
convergent subsequence, since any two different terms of any subsequence are a distance « 
or more apart. Thus X is not sequentially compact. This contradiction shows that X must be 
totally bounded. 


To show that X is complete, let {x,} be a Cauchy sequence in X. Since X is sequentially 
compact, a subsequence of {x,} converges to a point x € X. Using the Cauchy property it is 
not difficult to see that the whole sequence converges to x. Thus X is complete. LJ 


These three propositions complete the proof of the Characterization of Compactness 
Theorem. 

Since Euclidean space R” is complete, each closed subset is complete as a metric 
subspace. Moreover, Proposition 15 asserts that a subset of Euclidean space is bounded 
if and only if it is totally bounded. Therefore from our Characterization of Compactness 
Theorem we have the following characterization of compactness for a subspace of Euclidean 
space. 


Theorem 20 For a subset K of R", the following three assertions are equivalent: 


(i) K is closed and bounded; 
(ii) K is compact; 
(iii) K is sequentially compact. 


Regarding this theorem, the equivalence of (i) and (ii) is known as the Heine-Borel 
Theorem and that of (i) and (iii) the Bolzano-Weierstrass Theorem. In Chapter 1, we proved 
each of these in R = R!, because we needed both of them for the development of the 
Lebesgue integral for functions of a real variable. 


Proposition 21 Let f be a continuous mapping from a compact metric space X to a metric 
space Y. Then its image f(X) also is compact. 


Proof Let {O)},¢, be an open covering of f(X). Then, by the continuity of f, {f~!(O,)}aea 


covers f(X). 


One of the first properties of functions of a real variable that is established in a calculus 
course and which we proved in Chapter 1 is that a continuous function on a closed, bounded 
interval takes maximum and minimum values. It is natural to attempt to classify the metric 
spaces for which this extreme value property holds. 


Theorem 22 (Extreme Value Theorem) Let X be a metric space. Then X is compact if and 
only if every continuous real-valued function on X takes a maximum and a minimum value. 
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Proof First assume X is compact. Let the function f: X > R be continuous. The preceding 
proposition tells us that f(X) is a compact set of real numbers. According to Corollary 
20, f(X) is closed and bounded. We infer from the completeness of R that a closed and 
bounded nonempty set of real numbers has a largest and smallest member. 


To prove the converse, assume every continuous real-valued function on X takes 
a maximum and minimum value. According to Theorem 20, to show that X is compact 
it is necessary and sufficient to show it is totally bounded and complete. We argue by 
contradiction to show that X is totally bounded. If X is not totally bounded, then there is 
an r > 0 and a countably infinite subset of X, which we enumerate as {x, ye,» for which 
the collection of open balls {B(x,, r)}°°, is disjoint. For each natural number n, define the 
function f,: X > R by 


meaday go Xn) if p(x, Xn) <r/2 


0 otherwise. 


Define the function f: X > R by 
oO 
f(x) = Sin- f(x) for all xe X. 
n=1 


Since each f, is continuous and vanishes outside B(x, r/2) and the collection {B(xn, r)}%, 

is disjoint, f is properly defined and continuous. But for each natural number n, f(x,) = 
n-r/2, and hence /f is unbounded above and therefore does not take a maximum value. This 
is a contradiction. Therefore X is totally bounded. It remains to show that X is complete. Let 
{xn} be a Cauchy sequence in X. Then for each x € X, we infer from the triangle inequality 
that {p(x, x, )} is a Cauchy sequence of real numbers that, since R is complete, converges to 
a real number. Define the function f: X > R by 


f(x) = lim p(x, xn) for allxe X. 


Again by use of the triangle inequality we conclude that f is continuous. By assumption, 
there is a point x in X at which f takes a minimum value. Since {x,} is Cauchy, the infimum 
of f on X is 0. Therefore f(x) = 0 and hence {x,} converges to x. Thus Xiscomplete. 


If {Oa}aca iS an open cover of a metric space X, then each point x € X is contained in a 
member of the cover, O), and since O) is open, there is some € > 0, such that 


B(x,€) CO). (4) 


In general, the « depends on the choice of x. The following proposition tells us that for 
a compact metric space this containment holds uniformly in the sense that we can find « 
independently of x¢X for which the inclusion (4) holds. A positive number € with this 
property is called a Lebesgue number for the cover {O)}yc,. 


The Lebesgue Covering Lemma Let {O)},-, be an open cover of a compact metric space X. 
Then there is a number € > 0, such that for each x € X, the open ball B(x, €) is contained in 
some member of the cover. 
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Proof We argue by contradiction. Assume there is no such positive Lebesgue number. Then 
for each natural number n, 1/n fails to be a Lebesgue number. Thus there is a point in X, 
which we label x,, for which B(x,, 1/n) fails to be contained in a single member of the 
cover. This defines a sequence {x,} in X. By the Characterization of Compactness Theorem, 
X is sequentially compact. Thus a subsequence {x,, } converges to a point x9 € X. Now there 
is some Ag € A for which O,, contains x and since O,, is open, there is a ball centered at xo, 
B(xo, ro), for which 
B(x, 70) C Ory. 
We may choose an index k for which p(x0, Xn, ) <1ro/2 and 1/n, < ro/2. By the triangle 


inequality, B(xn,, 1/nx) C O,, and this contradicts the choice of x,, as being a point for 
which B(xn,, 1/n;) fails to be contained in a single member of the cover. 


Proposition 23 A continuous mapping from a compact metric space (X, p) into a metric 
space (Y, a) is uniformly continuous. 


Proof Let f be a continuous mapping from X to Y. Let « > 0. By the e€-6 criterion for 
continuity at a point, for each xe X, there is a 5, > 0 for which if p(x, x’) < 6,, then 
o( f(x), f(x’)) <«/2. Therefore, setting O, = B(x, 5,), by the triangle inequality for o, 


o(f(u), F(v)) so(f(u), f(x)) +o( f(x), f(v)) <cifu, vEO,. (5) 


Let 6 be a Lebesgue number for the open cover {O,},<x. Then for u, ve X, if p(u, v) <6 
there is some x for which u € B(v, 5) C O, and therefore, by (5), a( f(u), f(v)) <e. C 


PROBLEMS 


51. Consider the metric space Q consisting of the rational numbers with the metric induced by 
the absolute value. Which subspaces of Q are complete and which are compact? 


52. Let B = B(x, r) be an open ball in Euclidean space R". Show that B fails to be compact by 
(i) showing B is not sequentially compact, (ii) finding an open cover of B without any finite 
subcover, and (ii) showing B is not closed. 


53. When is a nonempty set X with the discrete metric a compact metric space? 


54. Let p and a be equivalent metrics on a nonempty set X. Show that the metric space (X, p) is 
compact if and only if the metric space (X, a) is compact. 


55. Show that the Cartesian product of two compact metric spaces also is compact. 
56. Show that the Cartesian product of two totally bounded metric spaces also is totally bounded. 


57. For E contained in a metric space X, show that the subspace E is totally bounded if and only 
if for each € > 0, E can be covered by a finite number of open balls (open in X) of radius € 
which have centers belonging to X. 


58. Let E be a subset of the compact metric space X. Show the metric subspace E is compact if 
and only if E is a closed subset of X. 


59. (Fréchet Intersection Theorem). Let {F,}°°, be a descending countable collection of 
nonempty closed subsets of a compact metric space X. Show that VP, F, #9. 


60. For a subset E of a metric space X, show that E is totally bounded if and only if its closure E 
is totally bounded. 


61 


62. 


63. 


64. 


65. 


66. 
67. 


68. 


69. 


70. 


71. 


72. 


73. 
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. For a subset E of a complete metric space X, show that E is totally bounded if and only if its 


closure E is compact. 


Let B = {{x,} € @7 | py x2 <1} be the closed unit ball in £2. Show that B fails to be compact 


by (i) showing B is not sequentially compact, (ii) finding an open cover of B without any finite 
subcover, and (ii) showing B is not totally bounded. 


Let B = {f € L’[a, b] ||| f\l2 < 1} be the closed unit ball in L[a, b]. Show that B fails to be 

compact by (1) showing B is not sequentially compact, (ii) finding an open cover of B without 

any finite subcover, and (ii) showing B is not totally bounded. 

Let X be a totally bounded metric space. | 

(i) If f is a uniformly continuous mapping from X to a metric space Y, show that f(X) is 
totally bounded. 

(ui) Is (4) still true if f is only required to be continuous? 


Let p be a metric on a set X. Define 


p(u, v) 


PA! for all X. 
T+ plu, v) or all u, ve 


T(u, v) = 


Verify that 7 is a bounded metric on X and convergence of sequences with respect to the 
p metric and the 7 metric is the same. Conclude that sets that are closed with respect to 
the p metric are closed with respect to the 7 metric and that sets that are open with respect to 
the p metric are open with respect to the r metric. Are the metrics p and r equivalent? 


Let E be a subset of Euclidean space R”. Assume every continuous real-valued function of 
E takes a minimum value. Prove that E is closed and bounded. 


Let E be a subset of Euclidean space R”. Assume every continuous real-valued function of 
E is uniformly continuous. Prove that E is closed and bounded. 


Suppose f is a continuous real-valued function on Euclidean space R” with the property that 


there is anumber c such that | f(x)| > c-||x|| for all x €R". Show that if K is a compact set of 


real numbers, then its inverse image under f, f— (K ), also is compact. (Mappings with this 
property are called proper.) 7 


For a compact metric space (X, p), show that there are points u, ve X for which p(u, v) = 
diam X. 


Let K be a compact subset of the metric space (X, p) and xq belong to X. Show that there is 
a point z € K for which 
p(z, x0) < p(x, x0) for all x K. 


Let K bea compact subset of the metric space X. For a point x € X ~ K, show that there is a 
open set Y/ containing K and an open set O containing x for which UN O= Q. 


Let A and B be subsets of a metric space (X, p). Define 
dist(A, B) = inf {p(u, v)| we A,veB}. 


If A is compact and B is closed, show that A N B = @ if and only if dist(A, B) > 0. 


Let K be a compact subset of a metric space X and O an open set containing K. Use the 
preceding problem to show that there i is an open set U for which K CUCUCO. 
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9.6 SEPARABLE METRIC SPACES 


Definition A subset D of a metric space X is said to be dense in X provided every nonempty 
open subset of X contains a point of D. A metric space X is said to be separable provided 
there is a countable subset of X that is dense in X. 


Observe that D is dense in X if and only if every point in X is a point of closure of D, 
that is, D = X. One of the first results of mathematical analysis is that the rational numbers 
are countable and dense in R (see Theorem 2 of Chapter 1). Therefore R is separable. From 
this we infer that every Euclidean space R” is separable. The Weierstrass Approximation 
Theorem tells us that the polynomials are dense in C[a, b]. Thus the set of polynomials 
with rational coefficients is countable and dense in C[a, b]. Therefore C[a, b] is separable. 
Theorem 11 of Chapter 7 tells us that for E a Lebesgue measurable set of real numers and 
1 < p <0, the normed linear space L?(E) is separable. We showed that L™[0, 1] is not 
separable. 


Proposition 24 A compact metric space is separable. 


Proof Let X be acompact metric space. Then X is totally bounded. For each natural number 
n, cover X by a finite number of balls of radius 1/n. Let D be the collection of points that are 
centers of one of this countable collection of covers. Then D is countable and dense. LI 


Proposition 25 A metric space X is separable if and only if there is a countable collection 
{On}~°., of open subsets of X such that any open subset of X is the union of a subcollection of 
{On}rn4- 


Proof First suppose X is separable. Let D be a countable dense subset of X. If D is finite, 
then X = D. Assume D is countably infinite. Let {x,} be an enumeration of D. Then 
{B(xn, 1/m)}n,men is a countable collection of open subsets of X. We claim that every open 
subset of X is the union of a subcollection of {B(xn, 1/m)}n,m¢n- Indeed, let O be an open 
subset of X. Let x belong to O. We must show there are natural numbers n and m for which 


x € B(X, 1/m) CO. (6) 


Since O is open, there is a natural number m for which B(x, 1/m) is contained in O. Since 
x is a point of closure of D, we may choose a natural number n for which x, belongs to 
DN B(x, 1/2m). Thus (6) holds for this choice of n and m. 


To prove the converse, suppose there is a countable collection {O,}°, of open sets 
such that any open subset of X is the union of a subcollection of {O,}°°_,. For each index 
n, choose a point in O, and label it x,. Then the set {x,}°°, is countable and is dense since 
every nonempty open subset of X is the union of a subcollection of {O,}°C, and therefore 
contains points in the set {x,}°C, . C 


Proposition 26 Every subspace of a separable metric space is separable. 


Proof Let E be a subspace of the separable metric space X. By the preceding proposition, 
there is a countable collection {O,}°°., of open sets in X for which each open set in X 
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is a union of some subcollection of {O,}°° ,. Thus {O, 9 E}°°, is a countable collection 
of subsets of FE, each one of which, by Proposition 2, is open. Since each open subset of 
E is the intersection of E with an open subset of X, every open subset of E is a union 
of a subcollection of {O, 9 E}P°,. We infer from the preceding proposition that E is 
separable. C 


Theorem 27 The following are separable metric spaces: 


(i) Each nonempty subset of Euclidean space R". 


(ii) For E a Lebesgue measurable set of real numbers and 1 < p < oo, each nonempty 
subset of L?(E). 
(iii) Each nonempty subset of C[a, b}. 


PROBLEMS 
74. Let X be a metric space that contains a finite dense subset D. Show that X = D. 
75. Show that for a subset D of a metric space X, D is dense in the subspace D. 


76. Show that if two continuous mappings defined on a metric space X take the same values on a 
dense subset, then they are equal. 


71. Show that the product of two separable metric spaces is again separable. 


78. Let p and o be equivalent metrics on a nonempty set X. Show that (X, p) is separable if and 
only if (X, o) is separable. 


79. Show that on any uncountable set X there is a metric on X with respect to which X is not 
separable. 
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In this chapter we establish three theorems that are widely used in mathematical analysis. 
These theorems will be essential tools in our later study of linear operators between Banach 
spaces and Hilbert spaces and of integration on general measure spaces. 


10.1 THE ARZELA-ASCOLI THEOREM 


In many important problems in analysis, given a sequence of continuous real-valued functions 
it is useful to know that there is a subsequence that converges uniformly. In this section, 
our main result is the Arzela~Ascoli Theorem, which provides a criterion for a uniformly 
bounded sequence of continuous real-valued functions on a compact metric space X to have 
a uniformly convergent subsequence. After we prove this theorem, we relate it to the general 
problem of finding criteria for a subset of a metric space to be compact. 

For a metric space X, we denote by C(X) the linear space of continuous real-valued 
functions on X. If X is compact, according to Theorem 22 of the preceding chapter, every 
continuous function on X takes a maximum value. For a function f in C(X), define 


ll fllmax = may | f(x) 


This defines a norm, as it did in the special case X = [a, b] we first considered in Chapter 7. 
This maximum norm induces a metric by 


Pmax( 8, h) = ||g —hllmax for all g, h EC(X). 


We call this metric the uniform metric because a sequence in C(X) converges with respect 
to this metric if and only if it converges uniformly on X. A sequence that is Cauchy with 
respect to this metric is called uniformly Cauchy. The proof of the completeness of C( X ) for 
a general compact metric space X is no different than the proof for the case X = [a, b] (see 
the proof of Proposition 10 of the preceding chapter). 


Proposition 1 If X is a compact metric space, then C(X) is complete. 
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Definition A collection F of real-valued functions on a metric space X is said to be 
equicontinuous at the point x € X provided for each «€ > 0, there is a 6 > 0 such that for every 
feF and x' eX, 


if p(x’, x) <6, then |f(x') — f(x)|<e. 


The collection F is said to be equicontinuous on X provided it is equicontinuous at every point 
in X. 


Of course, each function in an equicontinuous collecton of functions is continuous 
and any finite collection of continuous functions is equicontinuous. In general, an infinite 
collection of continuous functions will not be equicontinuous. For instance, for each natural 
number n, define f,(x) = x” for 0 < x < 1. Then { f,,} is a countable collection of continuous 
functions on [0, 1] that is not equicontinuous at x = 0 and is equicontinuous at the other 
points in [0, 1]. 


Example For M > 0, let F be the collection of continuous real-valued functions on the 
closed, bounded interval [a, b] that are differentiable on the open interval (a, b) and for 
which 


| f’| < M on (a, b). 
We infer from the the Mean Value Theorem that 


|f(u) — f(v)| < M- |u— | for all u, ve [a, 5]. 


Therefore F is equicontinuous since, regarding the criterion for equicontinuity at each point 
in X, 6 = €/M responds to the € > 0, challenge. 


A sequence {f,} of real-valued functions on a set X is said to be pointwise bounded 
provided for each x € X, the sequence { f,,(x)} is bounded and is said to be uniformly bounded 
on X provided there is some M > 0 for which 


| fn| < M on X for all n. 


Lemma 2 (The Arzela—Ascoli Lemma) Let X be a separable metric space and {f,,} an 
equicontinuous sequence in C(X) that is pointwise bounded. Then a subsequence of { f,} 
converges pointwise on all of X to a real-valued function f on X. 


Proof Let {x ie be an enumeration of a dense subset D of X. The sequence of real numbers 
defined by n+» f,,(x,) is bounded. Therefore, by the Bolzano-Weierstrass Theorem, this 
sequence has a convergent subsequence, that is, there is a strictly increasing sequence of 
integers {s(1,n)} and a number a, for which 


hm fs(1n) (41) = a1. 
Using the same argument, the sequence defined by n+> f,(1,,)(x2) is bounded and therefore 


there is a subsequence {s(2, n )} of {s(1, n)} and a number ap for which limy - 00 fs(2,n)(42) = 
a2. We inductively continue this selection process to obtain a countable collection of strictly 
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increasing sequences of natural numbers {{s(j, n a and a sequence of numbers {a;} such 
that for each j, 


{s(j +1, )} is a subsequence of {s( j,n)} and lim | Fs jn) (4) = aj. 


For each index j, define f(x j) = 4;. Consider the “diagonal” subsequence { fn, } obtained 
by setting n, = s(k,k) for each index k. For each j, {nx}0& j 18 a subsequence of the j-th 
subsequence of natural numbers selected above and therefore 


lim f(x) = aj = F(x) 


Thus { f,,} converges pointwise on D to f. 


For notational convenience, assume the whole sequence of { f,} converges pointwise 
on D to f. Let xo be any point in X. We claim that { f,,(xo)} is Cauchy. Indeed, let « > 0. By 
the equicontinuity of { f,} at xo, we may choose 5 > 0 such that | f,(x) — fn(xo)| < €/3 for all 
indices n and all x € X for which p(x, x9) < 6. Since D is dense, there is a point x € D such 
that p(x, xo) <6. Moreover, since { f,(x)} converges, it must be a Cauchy sequence, and so 
we may choose N so large that 


lfn(x) -— fm(x)| < €/3 for all m,n > N. 


Then for all m,n > N, 


|fn(%0) — fm(%0)! < | fn(x0) — fa(*)I + |fn(x) — fm (x)! 
+ |fm(x0) — fin(x)| <€/3 +€/3 +/3 =e. 


Thus { fn (xo )} is a Cauchy sequence of real numbers. Since R is complete, { f,, (xo )} converges. 
Denote the limit by f(x). The sequence { f,,} converges pointwise on all of X to f: X>R. 
LJ 


We proved that a continuous real-valued function on a compact metric space is 
uniformly continuous. The exact same proof shows that if X is a compact metric space 
and F is an equicontinuous collection of real-valued functions on X, then F is uniformly 
equicontinuous in the sense that for each € > 0, there is a 6 > 0 such that for u, ve X and 
any fef, 

if p(u, v) <6, then |f(u) — f(v)| <e. 


The Arzela—AscoliTheorem Let X be a compact metric space and { f,} auniformly bounded, 
equicontinuous sequence of real-valued functions on X. Then { f,} has a subsequence that 
converges uniformly on X to a continuous function f on X. 


Proof Since X is a compact metric space, according to Proposition 24 of the preceding 
chapter, it is separable. The Arzela-Ascoli Lemma tells us that a subsequence of { f,} 
converges pointwise on all of X to a real-valued function f. For notational convenience, 
assume the whole sequence { f,,} converges pointwise on X. Therefore, in particular, for each 
x in X, {f,(x)} is a Cauchy sequence of real numbers. We use this and equicontinuity to 
show that { f,} is a Cauchy sequence in C(X). 


a a i at 
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Let e > 0. By the uniform equicontinuity of { f,,} on X, there is a § > 0 such that for all n, 
lfn(u) — fn(v)| < €/3 for all u, ve X such that p(u, v) <6. (1) 


Since X is a compact metric space, according to Theorem 16 of the preceding chapter, it is 
totally bounded. Therefore there are a finite number of points x1, ..., x, in X for which X is 
covered by {B(x;, 5)}{_,. For 1 <i <k, {fy(x;)} is Cauchy, so there is an index N such that 


lfn(xi) — fin (xi)| < €/3 for 1 <i<kandalln,m > N. (2) 
Now for any x in X, there is ani, 1 < i < k, such that p(x, x;) <6, and therefore forn, m > N, 
Ifn(*) — f(x) < |fn() — fn( xi) + lfa(ai) — fm (22)! 
+|fm(xi) — fn(x)| < 6/3 +6/3 +6/3 =e. 


Thus { f,} is uniformly Cauchy. Therefore, since C(X) is complete, { f,} converges uniformly 
on X to a continuous function. CJ 


We proved that a metric space is compact if and only if it is sequentially compact. 
Furthermore, for a subspace K of Euclidean space R", the Heine-Borel Theorem tells us 
that K is compact if and only if K is a closed, bounded subset of R”. In a general metric 
space, being closed and bounded is a necessary condition for compactness (see Problem 1), 
but it is not sufficient. For example, the closed unit ball {f € C[0, 1] 1 WF llmax < 1} of C[0, 1] 
is a closed, bounded subset of C[0, 1] which fails to be sequentially compact. Indeed, the 
sequence {f,}, defined by f,(x) = x” on [0, 1] for all n, fails to have a subsequence that 
converges uniformly to a continuous function on [0, 1]. The Arzela-Ascoli Theorem may be 
reformulated as a criterion for the determination of the closed, bounded subsets of C(X) 
that are compact. 


Theorem 3 Let X be a compact metric space and F a subset of C(X). Then F is a compact 
subspace of C(X) if and only if F is closed, uniformly bounded, and equicontinuous. 


Proof First suppose that F is closed, bounded, and equicontinuous. Let { f,,} be a sequence 
in F. According to the Arzela-Ascoli Theorem, a subsequence of { f,} converges uniformly 
to a function in f €C(X). Since F is closed, f belongs to ¥. Thus F is a sequentially 
compact metric space and therefore is compact. 


Now assume ¥ is compact. We leave it as an exercise to show that F is bounded and 
is a closed subset of C(X). We argue by contradiction to show that F is equicontinuous. 
Suppose that F is not equicontinuous at a point x in X. Then there is an €9 > 0 such that for 
each natural number n, there is a function in F that we label f,, and a point X we label x, 
for which 

|fn(%n) — fn(x)| = €o while p(x,, x) <1/n. (3) 
Since F is acompact metric space, it is sequentially compact. Therefore there is a subsequence 
{ fn, } that converges uniformly on X to a continuous function f. Choose an index K such 
that pmax(f, fn, ) < €0/3 for k > K. We infer from (3) that for k > K, 


lf (Xn, ) — £(x)| > €0/3 while p(xn,, x) <1/ng. (4) 


This contradicts the continuity of f at the point x. Therefore F is equicontinuous. C 
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Remark The proof of the Arzela—Ascoli Lemma is very similar to the proof we provided 
in Chapter 7 of Helley’s Theorem. The common technique underlying both proofs is called a 
Cantor diagonalization argument (see Problem 14). 


Remark The forthcoming Riesz’s Theorem of Chapter 13 tells us that the closed unit ball of a 
normed linear space is compact if and only if the linear space is finite dimensional. Therefore, 
given a particular infinite dimensional normed linear space, it is interesting to characterize 
the closed, bounded subsets that are compact. The compactness criterion provided by the 
Arzela-Ascoli Theorem for subsets of C(X) has a £? counterpart. It is not difficult to show 
that, for 1 < p<oo, aclosed, bounded subset of €? is compact if and only if it is equisummable 
in the sense that for each € > 0, there is an index N for which 


OO 
S xxl? <€ forall x = {xn} ES. 
k=N 


PROBLEMS 
1. Let E be acompact subspace of a metric space Y. Show that E is aclosed, bounded subset of Y. 


2. Show that an equicontinuous sequence of real-valued functions on a compact metric space is 
pointwise bounded if and only if it is uniformly bounded. 


3. Show that an equicontinuous family of continuous functions on a compact metric space is 
uniformly equicontinuous. 


4. Let X be a metric space and {f,} a sequence in C(X) that converges uniformly on X to 
f €C(X). Show that { f,,} is equicontinuous. 


5. A real-valued function f on [0, 1] is said to be Hélder continuous of order a provided there 
is a constant C for which 


lf(x) — f(y)| < Clx — y|* for all x, ye[0, 1]. 
Define the Holder norm 


I fll = max {| f(x)l + 1f(x) — f(y)I/Ix— yl? | x, ye [0, 1, x#y}. 


Show that for 0 < a < 1, the set of functions for which || f ||, < 1 has compact closure as a 
subset of C[0,1]. 


6. Let X be a compact metric space and ¥ a subset of C( X). Show that F is equicontinuous 
if and only if its closure in C(X), ¥, is equicontinuous. Conclude that a subset of C(X) has 
compact closure if and only if it is equicontinuous and uniformly bounded. 


7. Foraclosed, bounded interval [a, b], let { f,} be asequence in C[a, b]. If { f,}is equicontinuous, 
does { f,} necessarily have a uniformly convergent subsequence? If { f,,} is uniformly bounded, 
does { f,} necessarily have a uniformly convergent subsequence? 


8. Let X be a compact metric space and Y be a general metric space. Denote by C(X, Y) the set 
of continuous mappings from X to Y. State and prove a version of the Arzela-Ascoli Theorem 
for a sequence in C(X, Y) in which the assumption that { f,,} is pointwise bounded is replaced 
by the assumption that for each x € X, the closure of the set { f,,(x) | a natural number} is a 
compact subspace of Y. 


9. Let {f,} be an equicontinuous, uniformly bounded sequence of continuous real-valued 
functions on R. Show that there is a subsequence of { f,,} that converges pointwise on R toa 
continuous function on R and that the convergence is uniform on each bounded subset of R. 
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10. For 1 < p <oo, show that a subspace of 2? is compact if and only if it is closed, bounded, and 
equisummable. | 


11. For a sequence of nonnegative real numbers {c,}, let S be the subset of 2” consisting of those 
x = {x,} €@ such that |x,| < c, for all n. Show that S is equisummable if {c,} belongs to 27. 


12. For 1 < p < ov, show that the closed unit ball in the Banach space £? is not compact. 
13. For 1 < p < oo, show that the closed unit ball in the Banach space L?[0, 1] is not compact. 


14. Let S be a countable set and {f,,} a sequence of real-valued functions on S that is pointwise 
bounded on S. Show that there is a subsequence of { f,,} that converges pointwise on S to a 
real-valued function. 


10.2 THE BAIRE CATEGORY THEOREM 


Let E be a subset of a metric space X. A point x € E is called an interior point of E provided 
there is an open ball centered at x that is contained in E: the collection of interior points of E 
is called the interior of E and denoted by int E. A point x € X ~ E 1s called an exterior point 
of E provided there is an open ball centered at x that is contained in X ~ E: the collection of 
exterior points of E is called the exterior of E and denoted by ext E. If a point x € X has the 
property that every ball centered at x contains points in E and points in X ~ E, it is called a 
bounday point of E: the collection of boundary points of E is called the boundary of E and © 
denoted by bd E. We leave it as an exercise to verify that for any subset E of X: | | 


X = int E Uext E Ubd E and the union is disjoint. (5) 


Recall that a subset of A of a metric space X is said to be dense (in X) provided every 
nonempty open subset of X contains a point on A. We call a subset of a metric space hollow 
(in X) provided it has empty interior.! Observe that for a subset E of a metric space X, 


E is hollow in X if and only if its complement, X ~ E, is dense in X. (6) 


For a metric space X, a point x € X and 0<r; <ry, we have the inclusion B(x, r;) C B(x, r2). 
From the continuity of the metric we infer that B(x, r;) is closed and it contains B(x, r;). 
Thus the closure of B(x, r;) is contained in B(x, rz). Therefore, if O is an open subset of 
a metric space X, for each point x € O, there is an open ball centered at x whose closure is 
contained in O. 


The Baire Category Theorem Let X be a complete metric space. 


(i) Let{O,}°°., be a countable collection of open dense subsets of X. Then the intersection 
M2, On also is dense. 

(ii) Let {F,}°2, be a countable collection of closed hollow subsets of X. Then the union 
Ure, Fn also is hollow. 


Proof A set is dense if and only if its complement is hollow. A set is open if and only if its 
complement is closed. We therefore infer from De Morgan’s Identities that (i) and (i) are 
equivalent. We establish (i). Let x9 belong to X and rp > 0. We must show that B(xo, 70) 
contains a point of (\P°, O,. The set B(xo, ro) MO; is nonempty since O; is dense in X. 


The adjective “hollow” was suggested by Adam Ross. 
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Let x; belong to the open set B(x, r9) MO). Choose r;, 0 <r, <1, for which, if we define 
B, = B(x, r,), then 

By C B(x, 1) NO}. (7) 
Suppose n is a natural number and the descending collection of open balls {By}; _, has 
been chosen with the property that for 1 < k <n, B, has radius less than 1/k and 
By C Ox. The set B, On+1 1S nonempty since O, 4; is dense in X. Let x,41 belong to 
the open set B(Xn, rn) MW On. Choose 41,0 < 141 <1/(n +1), for which, if we define 
Brit = B(Xn41, Mn41)> Bn4t © Bn MOn41. This inductively defines a contracting sequence 
of closed sets {B,}°°, with the property that for each n, B, C O,. The metric space X 
is complete. We therefore infer from the Cantor Intersection Theorem that °°, Bn is 
nonempty. Let x, belong to this intersection. Then x, belongs to °°, O,. On the other 
hand, by (7), x. also belongs to B(xo, 79). This completes the proof of (i). L 


A subset E of a metric space X is called nowhere dense provided its closure E is hollow. 
A subset E of X is nowhere dense if and only if for each open subset O of X, ENO is not 
dense in O (see Problem 16). The Baire Category Theorem has the following equivalent 
formulation: In a complete metric space, the union of a countable collection of nowhere 
dense sets is hollow. 


Corollary 4 Let X be a complete metric space and {F,}p°., 4 countable collection of closed 
subsets of X. If 72, Fn has nonempty interior, then at least one of the Fn’s has nonempty 
interior. In particular, if X =\JP°_, Fn, then at least one of the F,’s has nonempty interior. 


Corollary 5 Let X be a complete metric space and {F, yoo, 4 countable collection of closed 
subsets of X. Then \JP-_, bd F;, is hollow. 


Proof We leave it as an exercise to show that for any closed subset E of X, the boundary of 
E, bd E, is hollow. The boundary of any subset of X is closed. Therefore, for each natural 
number n, bd F,, is closed and hollow. According to the Baire Category Theorem, °°, bd F, 
is hollow. O 


Theorem 6 Let F be a family of continuous real-valued functions on a complete metric space 
X that is pointwise bounded in the sense that for each x € X, there is a constant M, for which 
|f(x)| < M, forall f € F. 


Then there is anonempty open subset O of X on which F is uniformly bounded in the sense 
that there is a constant M for which 


[fl < M on O forall f EF. (8) 


Proof For each index n, define E, = {x € X||f(x)| <n forall f ¢F}. Then E, is closed 
since each function in ¥ is continuous and the interection of a collection of closed sets is 
closed. Since ¥ is pointwise bounded, for each x € X, there is an index n such that | f (x)| <n 


o @) 
for all f € F; that is, x belongs to E,. Hence X = J E,. Since X isa complete metric space, 


n=1 
we conclude from Corollary 4 that there is a natural number n for which E, contains an open 
ball B(x, r). Thus (8) holds for O = B(x, r) and M =n. O 
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We have seen that if a sequence of continuous real-valued functions converges 
uniformly, then the limit function is continuous and this is false for pointwise convergence. 
However, under pointwise convergence to a real-valued function of a sequence of continuous 
real-valued functions on a complete metric space, the limit function is continuous at each 
point in a dense subset of its domain. 


Theorem 7 Let X be a complete metric space and { f,} a sequence of continuous real-valued 
functions on X that converges pointwise on X to the real-valued function f. Then there is a 
dense subset D of X for which { f,,} is equicontinuous and f is continuous at each point in D. 


Proof Let m and n be natural numbers, Define 
E(m,n) = {xeX | |fj(x) — fe(x)| <1/m forall j,k >n}. 


Since each function x+>| f(x) — f;(x)| is continuous, the set E(m, n), being the intersection 
of a collection of closed sets, is closed. According to Corollary 5, 


p=x~| J bd Ena] 
n,meN 


is dense in X. Observe that if n and m are natural numbers and the point x in D belongs to 
E(m, n), then x belongs to the interior of E(m, n). We claim that { f,} is equicontinuous at 
each point of D. Indeed, let x9 belong to D. Let e > 0. Choose a natural number m for which 
1/m <e/4. Since { fn (xo )} converges to a real number, { f,( xo )} is Cauchy. Choose a natural 
number WN for which 


| fi(x0) — fe(xo))| < 1/m for all j,k > N. (9) 


Therefore xp belongs to E,»,, vy. As we observed above, xo belongs to the interior of E(m, N). 
Choose r > 0 such that B(x, r) C E(m, N), that is, 


|fi(*%) — fe(x))| < 1/m for all j,k > N and all x € B(x, r). (10) 
The function fy is continuous at x9. Therefore there is a 6,0 <6 <r, for which 
| f(x) — fu(xo)| <1/m for all x € B(x, 8). (11) 
Observe that for every point x € X and natural number j, 
fix) — Fy(x0) = Li) — fv (2) + Lf (x) — fiv(x0)] + Lf(20) — fi) 
We infer from (9), (10), (11), and the triangle inequality that 
|fi(x) — f7(x0)| < 3/m < [3/4]e for all j > N and all x € B(x, 4). (12) 


The finite family of continuous functions { f Mo! is clearly equicontinuous at xo. We there- 


fore infer from (12) that {f,,} is equicontinuous at x9. This implies continuity at xo. Indeed, 
take the limit as j > 00 in (12) to obtain 


|f(x) — f(xo)| < and all x € B(x, 6). (13) 
O 


214 Chapter10 Metric Spaces: Three Fundamental Theorems 


Remark There is standard terminology associated with the ideas of this section. A subset E of 
a metric space X is said to be of the first category (or meager) if E is the union of a countable 
collection of nowhere dense subsets of X. A set that is not of the first category is said to be of 
the second category (or nonmeager), and the complement of a set of first category is called 
residual (or co-meager). The Baire Category Theorem may also be rephrased as follows: an 
open subset of a complete metric space is of the second category. 


Remark The consequences of the Baire Category Theorem are surprisingly varied. In Chapter 
13, we use Theorem 6 to prove the Open Mapping Theorem and the Uniform Boundedness 
Principle, two cornerstones for the study of linear functionals and operators. In Chapter 18, we 
use Theorem 7 to prove the Vitali-Hahn-Saks Theorem regarding the convergence of measures, 
an essential ingredient in the description of weak convergence in L'( E). In Problems 20 and 
21, two interesting properties of continuous and differentiable functions are deduced from the 
Baire Category Theorem. 


PROBLEMS 


15. Let E be a subset of a metric space X. Show that bd E is closed. Also show that if E is closed, 
then the mterior of bd E is empty. 


16. In a metric space X, show that a subset E is nowhere dense if and only if for each open subset 
O of X, ENO is not dense m O. 


17. In a complete metric space X, is the union of a countable collection of nowhere dense sets 
also nowhere dense? 


18. Let O be an open subset and F be a closed subset of a metric space X. Show that bothO~O 
and F ~ int F are closed and hollow. 


19. In a complete metric space, is the union of a countable collection of sets of the first category 
also of the first category? 


20. Let F, be the subset of C[0, 1] consistmg of functions for which there is a point xp m [0, 1] 
such that | f(x) — f(xo)| <n|x — xo| for all x €[0, 1]. Show that F, is closed. Show that F,, is 
hollow by observing that for f ¢ C[0, 1] andr >0 there a piecewise linear function g € C[0, 1] 
for which pmax( f, g) <r and the the left-hand and right-hand derivatives of g on [0, 1] are 
greater than n + 1. Conclude that C[0, 1]#U?2, F, and show that eachh €C[0, 1]~ U2, F, 
fails to be differentiable at any pomt im (0, 1). 


21. Let f be a real-valued function on a metric space X. Show that the set of pomts at which f 
is contmuous is the mtersection of a countable collection of open sets. Conclude that there is 
not a real-valued function on R that is contmuous just at the rational numbers. 


22. For each natural number n, show that m [0, 1] there is a nowhere dense closed set that has 
Lebesgue measure 1 — 1/n. Use this to construct a set of the first category m [0, 1] that has 
measure 1. 


23. A pomt x m a metric space X is called isolated provided the singleton set {x} is open m X. 
(i) Prove that a complete metric space without isolated pomts has an uncountable number 
of points. 


(ii) Use part (i) to prove that [0, 1] is uncountable. Compare this with the proof that [0, 1] 
is uncountable because it has positive Lebesgue measure. 
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(iti) Show that if X is a complete metric space without isolated points and {F, jo, isa 


countable collection of closed hollow sets, then X ~ Ur... Fn is dense anduncountable. 
24. Let E be a subset of a complete metric space X. Verify the following assertions. 
(i) If X~E is dense and F is a closed set contained in E, then F is nowhere dense. 


(ii) If E and X~E are both dense, then at most one of them is the union of a countable 
collection of closed sets. 


(iii) The set of rational numbers in (0, 1] is not the intersection of a countable collection of 
open sets. 


25. Show that under the hypotheses of Theorem 6 there is a dense open set O C X such that each 
x €O has a neighborhood U on which F is uniformly bounded. 


26. By Hélder’s Inequality, we have L?[a, b] C La, b]. Show that the set Wha, b], considered 
_ asa subset of the complete metric space L}[a, 5], is of the first category. 


27. Let f be a continuous real-valued function on R with the property that for each real number 
x, lin + oo f(nx) = 0. Show that lim, -, . f(x) =0. | 
28. Let f be a continuous real-valued function on R that has derivatives of all orders. Suppose 


that for each real number x, there is an index n = n(x) for which f(")(x) = 0. Show that f is 
a polynomial. (Hint: Apply the Baire Category Theorem twice.) 


10.3 THE BANACH CONTRACTION PRINCIPLE | 


Definition A point x in X is called a fixed point of the mapping T: X>X provided 
T(x) =x. | 


We are interested here in finding assumptions on a mapping that ensures it has a fixed 
point. Of course, a mapping may or may not have any fixed points. For instance, the mapping 
I’: R— R defined by T(x) = x + 1 certainly has no fixed points. 

A fixed point of a real-valued function of a real variable corresponds to a point in the 
plane at which the graph of the function intersects the diagonal line y = x. This observation 
provides the geometric insight for the most elementary result regarding the existence of 
fixed points: Let [a, b] be a closed, bounded interval in R and suppose that the image of 
the continuous function f: [a, b] > R is contained in [a, b]. Then f: [a, b] >R has a fixed 
point. This follows from the Intermediate Value Theorem by observing that if we define 
(x) = f(x) —x for xin [a, 5}, then g(a) > 0 and g(b) < 0, so that g(x) = 0 for some 
xo in [a, b], which means that f (xq) = Xo. | 

A subset K of R” is said to be convex provided whenever u and v belong to K, the 
segment {tu + (1—1)v|0 <¢t < 1) is contained in K. The preceding result generalizes to 
mappings on subsets of Euclidean spaces as follows: If K is a compact, convex subset of R” 
and the mapping T: K — K is continuous, then T has a fixed point. This is called Brouwer’s 
Fixed Point Theorem.” Here we will prove an elementary fixed point result called the Banach 
Contraction Principle in which there is a more restrictive assumption on the mapping but a 
very general assumption on the underlying space. 


2An analytic proof of this theorem may be found in Linear Operators, Part I (pp. 467-469) by Nelson Dunford 
and Jacob Schwartz [DS71]. 
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Definition A mapping T froma metric space ( X, p) into itself is said to be Lipschitz provided 
there is anumber c > 0, called a Lipschitz constant for the mapping, for which 
p(T(u), T(v)) <cp(u, v) forallu, ve X. 
Ifc <1, the Lipschitz mapping is called a contraction. 


The Banach Contraction Principle Let X be a complete metric space and the mapping 
T: X — X be acontraction. Then T: X — X has exactly one fixed point. 


Proof Let c be anumber with 0 < c <1 that is a Lipschitz constant for the mapping T. Select 
a point in X and label it x9. Now define the sequence {x;} inductively by defining x, = T(xo) 
and, if k is a natural number such that x; is defined, defining x,,; = T(x, ). The sequence {x,} 
is properly defined since T( X) is a subset of X. We will show that this sequence converges 
to a fixed point of T. 


Indeed, observe that by the definition of the sequence and the Lipschitz constant c, it 
follows that 


p(x2, x1) = p(T (x1), T(x0)) = p(T(T(x0)), T(x0)) < ¢e(T (x0), x0), 


and that 
P(xe41, Xk) = p(T (xk), T(xe-1)) SC p(xk, xe-1) fk > 2. 


Using an induction argument, we infer from these two inequalities that 
p(Xk41, Xk) < cép(T(xo), xo) for every natural number k. 


Hence, if m and k are natural numbers with m > k, from the triangle inequality for the metric 
p and the geometric sum formula,? it follows that 


P( Xm, Xk) < P( Xm, Xm—1) + P(Xm—1, Xm—2) + +++ + p(Xe+1, Xk) 
< [om +cm 4... + c*]p(T (x0), x0) 
_ pk m—1—k 
=e[l+c+---+¢ Je(T (x0), x0) 


1—cm-* 


1-—c 


=e. - p(T(x0), x0). 


Consequently, since 0 < c <1, 


ck 


1—c 


p(Xm, Xk) < -p(T(x0), x0) ifm > k. 


i a a 
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But lim; _.o. ck = 0, and hence, from the preceding inequality we conclude that {x;} is a 
Cauchy sequence. 


By assumption, the metric space X is complete. Thus there is a point x in X to which 
the sequence {x;,} converges. Since T is Lipschitz, it is continuous. Therefore 


T(x) = jim T (x,) = im x41 = Xx. 


Thus the mapping T: X — X has at least one fixed point. It remains to check that there is 
only one fixed point. But if u and v are points in X such that T(u) = u and T(v) = v, then 


0< plu, v) =p(T(u), T(v)) <cp(u, v), 


so that since 0 < c < 1, we must have p(u, uv) = 0, that is, wu = v. Thus there is exactly one 
fixed point. L 


The above proof of the Banach Contraction Principle actually proves substantially 
more than the existence of a unique fixed point. It provides an algorithm for approximating 
the fixed point. Indeed, under the assumptions of the Banach Contraction Principle, what 
has been proven is that if c is a number with 0 < c <1 that is a Lipschitz constant for the 
mapping T: X— X, and xo is any point in X, then (i) the sequence {x;,} defined recursively 
by setting x1 = T(xo) and x,41 = T(x;) for k > 1 converges to a fixed point x, of T and (ii) 


ck 


P( Xx, XK) < Toe’ p(T(xo), xo) for every natural number k. 


The Banach Contraction Principle is widely used in the study of nonlinear differential 
equations. We provide one example of its use. Suppose O is an open subset of the plane 
R? that contains the point (xo, yo). Given a function g: O— R, the problem we pose is to 
find an open interval of real numbers / containing the point xo and a differentiable function 
f: I—R such that 
f'(x)= g(x, f(x)) for all xe/ 


f(xo)= yo. 


A very special case of the above equation occurs if g is independent of its second 
variable, so g(x, y) = h(x). Even in this case, if the image of the function h: JR fails 
to be an interval, there is no solution of equation (14) (see Problems 42 and 43). On the 
other hand, if h is continuous, then it follows from the Fundamental Theorem of Differential 
Calculus that equation (14) has a unique solution given by 


(14) 


f(x) =»+ h(t) dt for all x € J. 


X0 


Therefore for a general continuous real-valued function of two variables g, if a continuous 
function f: J > R has the property that (x, f(x)) €O for each x €/, then f is a solution of 
(14) if and only if 


lx) =y0+ fale f(t)) dt for all x € J. (15) 


X0 
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As we will see in the proof of the next theorem, this equivalence between solutions of the 
differential equation (14) and those of the integral equation (15) is the observation that 
permits us to use fixed point theorems in the study of differential equations. 


The Picard Local Existence Theorem Let © be an open subset of the plane R* containing the 
point (xo, yo). Suppose the function g: O> R? is continuous and there is a positive number 
M for which the following Lipschitz property in the second variable holds, uniformly with 
respect to the first variable: 


le(x, v1 )—2(x, y2)| < Mly1—y2| for all points (x, y;) and (x, y2) in O. (16) 


Then there is an open interval I containing xo on which the differential equation (14) has a 
unique solution. 


Proof For £ a positive number, define I; to be the closed interval [xo — ¢, xo + £]. In view of 
the equivalence noted above between solutions of (14) and (15), it suffices to show that £ can 
be chosen so that there is exactly one continuous function f: J; — R having the property that 


f(x) = w+ fate f(t)) dt for all xe I. 


x0 


Since O is open, we may choose positive numbers a and b such that the closed rectangle 
R = [xo — a, xo +a] X[yo — b, yo +b] is contained in O. Now for each positive number £ with 
£ <a, define X, to be the subspace of the metric space C( J; ) consisting of those continuous 
functions f: I, > R that have the property that 


|f(x) — yol < b for all x € Ie; 
that is, the continuous functions on J; that have a graph contained in the rectangle 


Ie [yo —b, yo+ bj. 
For f € X¢, define the function T( f) € C(J2) by 


T( f)(x) =yot [ g(t, f(t) )dt for all x € Ip. 


x0 


A solution of the integral equation (15) is simply a fixed point of the mapping T: X~¢ > C( Iz). 
The strategy of the proof is as follows: Since C(J;) is a complete metric space and X; is a 
closed subset of C( Iz), X¢ is also a complete metric space. We will show that if £ is chosen 
sufficiently small, then 


T(X¢)C Xz and T: X, > X; is a contraction. 


Hence, we infer from the Banach Contraction Principle that T: X;—>X¢ has a unique 
fixed point. 

In order to choose £ so that T(X¢) C X¢ we first use the compactness of the closed, 
bounded rectangle R together with the continuity of g to choose a positive number K such that 


le(x, y)| < K for all points (x, y) in R. 
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Now for f € X¢ and xe kj, 


T(t )(x)— wl =| fal seoyael < ex, 


x0 


so that 
T(X¢) C X¢ provided LK < b. 


Observe that for functions f;, fo € Xe, and x € I;, we may infer from (16) that 
le(x, filx)) — g(x, fo(x))| < MPmax( fi, fr). 


Consequently, using the linearity and monotonicity properties of the integral, we have 


TAN) = TAVAL= | [fete AC) — ale, AU) 


< |x — x9|Mpmax( fi, fr) 


< £Mpmax( fi, fa). 
This inequality, together with the inclusion T(X¢) C X¢ provided £K < b, implies that 
T: X¢—> Xz; is acontraction provided £K < b and 2M <1. 


Define £ = min{b/K, 1/2M}. The Banach Contraction Principle tells us that the mapping 
T: X¢—>X, has a unique fixed point. L 


PROBLEMS 


| 29. Let p be a polynomial. Show that p: R-> Ris Lipschitz if and only if the degree of p is less 
| than 2. 


30. Fix a > 0, define f(x) = ax(1— x) for x in [0, 1]. 
(i) For what values of a is f([0, 1]) C[0, 1]? 
(ul) For what values of a is f([0, 1]) C[0, 1] and f: [0, 1] [0, 1] a contraction? 


31. Does a mapping of a metric space X into itself that is Lipschitz with Lipschitz constant less 
than 1 necessarily have a fixed point? 


32. Does a mapping of a complete metric space into itself that is Lipschitz with Lipschitz constant 
1 necessarily have a fixed point? 


| 33. Let X be a compact metric space and T a mapping from X into itself such that 
p(T(u), T(v)) < p(u, v) for all u, ve X. 


Show that T has a unique fixed point. 
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34. 


35. 


37. 


39. 


41. 


42. 


Define f(x) = 7/2 +x — arctan x for all real numbers x. Show that 
|f(u) — f(v)| < Ju -— v| for all u, veR. 


Show that f does not have a fixed point. Does this contradict the preceding problem? 


In Euclidean space R” consider the closed unit ball B = {x ER" | ||x|| < 1}. Let f map B 
into B and be Lipschitz with Lipschitz constant 1. Without using the Brouwer Fixed Point 
Theorem, show that f has a fixed point. 


. Suppose that the mapping f: R” > R” is a contraction. Define g(x) = x — f(x) for all x in 


R”. Show that the mapping g: R” > R” is both one-to-one and onto. Also show that g and 
its inverse are continuous. 


Let X be a complete metric space containing the point xo and let r be a positive real number. 
Define K = {xin X|p(x,x9) < r}. Suppose that the mapping T: K-—>X is Lipschitz with 
Lipschitz constant c. Suppose also that cr + p(T(xo), x9) <1. Prove that T(K) C K and that 
T: K—»X has a fixed point. 


. Show that if the function g: R? > R has continuous first-order partial derivatives, then for 


each point (xo, yo) in R* there is a neighborhood © of (xo, yo) on which the Lipschitz 
assumption (16) holds. 


In case the function g: O > R has the form g(x, y) = h(x)+by, where the functionh: R>R 
is continuous, prove that the following is an explicit formula for the solution of (14): 


f(x) = Hayy + [ eh) dt for all x in I. 


x0 


. Consider the differential equation 


fi(x)= 3[f (x)? for allxeR 
f(0)= 0. 
Show that the function f: R — R that is identically 0 is a solution and the function f: R>R 


defined by f(x) =0, if x <Oand f(x) = x’, if x > 0, is also a solution. Does this contradict 
the Picard Existence Theorem? 


For a positive number €, consider the differential equation 


1/e)[1 + (f(x))*] for allxeR 


f(x 


)= ( 
)= 0 


Show that on the interval J = (—e(7/2), €(a/2)) there is a unique solution of this differential 
equation that is defined by f(x) = tan(x/e) and there is no solution in an interval strictly 
containing /. 


Let J be an open interval in R and suppose that the function h: J > R has the property that 
there are points x1 < x2 in J anda number c such that h(x; ) <c < h(x) but c does not belong 
to h(I). Prove that there is no solution to the differential equation (14) by arguing that if 
f: 1 Risa solution, then the continuous function f(x) — cx fails to attain a minimum value 
on the interval [x), x2]. 
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43. Use the preceding exercise to prove the following theorem of Darboux: Let / be an open 
interval in R and suppose that the function f: J > R is differentiable. Then the image of the 
derivative f’: J -> R is an interval. 


44. State and prove a form of the Picard Existence Theorem for systems of differential equations 
in the following context: O is an open subset of R X R", g: O > R” is continuous, the point 
(x0, Yo) is in O, and the system of differential equations is : 


f’(x) = g(x, f(x)) forall xe] 
f(x0) = yo. 


(Hint: Approximate gbya Lipschitz mapping and then use the Arzela-Ascoli Theorem.) 
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We devoted the preceding two chapters to the study of metric spaces. In these spaces, we 
first used the metric to define an open ball and then used open balls to define open sets. We 
found that we were able to express a number of concepts solely in terms of the open sets 
associated with the metric. In the present chapter we study spaces for which the notion of an 
open set is fundamental: other concepts are defined in terms of open sets. Such spaces are 
called topological spaces. They are more general than metric spaces. Perhaps you ask: Why 
not stick to metric spaces? From the viewpoint of analysis the main reason is that it is often 
necessary to study such concepts as convergence of a sequence or compactness of a set in 
a setting more general than that provided by a metric space. One immediate example is to 
consider a collection of real-valued functions on a set. The concept of uniform convergence 
of a sequence of functions is a metric concept. The concept of pointwise convergence is not a 
metric concept. Another prominent example arises for a set X that is a normed linear space. 
The set X, with the metric induced by the norm, is a metric space. With respect to this metric, 
one has the concept of convergence of a sequence and compactness of a set. But on X there 
are important concepts, such as weak convergence of a sequence (we studied this in Chapter 
8) and weak compactness of a set, which cannot be formulated in the framework of a metric. 
They can be formulated as topological concepts for a topology on a normed linear space called 
the weak topology. Furthermore, the comparison of topologies illuminates our understanding 
of subtleties that arise when considering different modes of sequential convergence. 


11.1. OPEN SETS, CLOSED SETS, BASES, AND SUBBASES 


Definition Let X be anonempty set. A topology T for X is a collection of subsets of X, called 
open sets, possessing the following properties: 


(i) The entire set X and the empty-set ® are open; 


Section 11.1 Open Sets, Closed Sets, Bases, and Subbases 223 


(ti) The intersection of any finite collection of open sets is open; 
(iii) The union of any collection of open sets is open. 


A nonempty set X, together wih a topology on X, is called a topological space. For a point x 
in X, an open Set that contains x is called a neighborhood of x. 


We sometimes denote a topological space by (X, TJ ). Often we are interested in only 
one topology for a given set of points, and in such cases we sometimes use the symbol X to 
denote both the set of points and the topological space (X, J). When greater precision is 
needed, we make explicit the topology. 


Proposition 1 A subset E of a topological space X is open if and only if for each point x in X 
there is a neighborhood of x that is contained in E. 


Proof This follows immediately from the definition of neighborhood and the property of a 
topology that the union of a collection of open sets is again open. LI 


Metric Topology Consider a metric space (X, p). Define a subset O of X to be open 
provided for each point x €O there is an open ball centered at x that is contained in O. 
Thus the open sets are unions of collections of open balls. Proposition 1 of Chapter 9 is the 
assertion that this collection of open sets is a topology for X. We call it the metric topology 
induced by the metric p. As a particular case of a metric topology on a set we have the 
topology we call the Euclidean topology induced on R" by the Euclidean metric.! 


The Discrete Topology Let X be any nonempty set. Define T to be the collection of 
all subsets of X. Then T is a topology for X called the discrete topology. For the discrete 
topology, every set containing a point is a neighborhood of that point. The discrete topology 
is induced by the discrete metric. 


The Trivial Topology Let X be any nonempty set. Define T to be the collection 
of subsets of X consisting of @ and X. Then T is a topology for X called the triv- 
ial topology. For the trivial topology, the only neighborhood of a point is the whole 
set X. 


Topological Subspaces Given a topological space (X, J) and a nonempty subset E 
of X, we define the inherited topology S for E to consist of all sets of the form EM O where 
O belongs to T. We call the topological space (E, S) a subspace of (X, T ). 

In elementary analysis we define what it means for a subset of R to be open even if 
we have no need to use the word “topology.” In Chapter 1, we proved that the topological 
space R has the property that every open set is the union of a countable disjoint collection 
of open intervals. In a metric space, every open set is the union of a collection of open balls. 


1Unless otherwise stated, by the topological space R” we mean the set R” with the Euclidean topology. In the 
problems we introduce more exotic topologies on R and R2 (see Problems 9 for the Sorgenfrey Line and 10 for the 
Moore Plane). 


224 Chapter 11 Topological Spaces: General Properties 


In a general topological space it is often useful to distinguish a collection of open sets called 
a base for the topology: they are building blocks for the topology. 


Definition For a topological space (X, T ) and a point x in X, a collection of neighborhoods 
of x, B,, is called a base for the topology at x provided for any neighborhood U of x, there 
is a set B in the collection B, for which B CU. A collection of open sets B is called a base for 
the topology 7 provided it contains a base for the topology at each point. 


Observe that a subcollection of a topology is a base for the topology if and only if 
every nonempty open set is the union of a subcollection of B. Once a base for a topology is 
prescribed, the topology is completely defined: it consists of @ and unions of sets belonging 
to the base. For this reason a topology is often defined by specifying a base. The following 
proposition describes the properties that a collection of subsets of X must possess in order 
for it to be a base for a topology. 


Proposition 2 For a nonempty set X, let B be a collection of subsets of X. Then B is a base 
for a topology for X if and only if 


(i) Bcovers X, that is, X =VpepB. 
(ii) if B, and Bp are in B and x € B, Bo, then there is aset Bin B for whichx € BC By NB. 


The unique topology that has B as its base consists of 6 and unions of subcollections of B. 


Proof Assume B possesses properties (i) and (ii). Define T to be the collection of unions 
of subcollections of B together with 0. We claim that T is a topology for X. Indeed, we infer 
from (i) that the set X is the union of all the sets in B and therefore it belongs to 7. Moreover, 
it is also clear that the union of a subcollection of T is also a union of a subcollection of B 
and therefore belongs to T. It remains to show that if 0; and O» belong to 7, then their 
intersection O; NO belongs to T. Indeed, let x belong to 0; NO>. Then there are sets B, and 
By in Bsuch that x € By CO; and x € By C O». Using (ii), choose B, in B with x € By C By N Bo. 
Then 0; N02 =U, By, the union of a subcollection of B. Thus T is a topology for which 
Bis a base. It is unique. We leave the proof of the converse as an exercise. a 


A base determines a unique topology. However, in general, a topology has many bases. 
For instance, the collection of open intervals is a base for the Euclidean topology on R, 
while the collection of open, bounded intervals with rational endpoints also is a base for this 
topology. 


Example Let (X, 7) and (Y, S) be topological spaces. In the Cartesian product X x Y, 
consider the collection of sets B consisting of products 0, x 02, where ©, is open in X and 
O2 is open in Y. We leave it as an exercise to check that B is a base for a topology on X XY. 
The topology is called the product topology on X x Y. 


Definition For a topological space (X, T), a subcollection S of T that covers X is called a 
subbase for the topology T provided intersections of finite subcollections of S are a base for T. 
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Example Consider a closed, bounded interval [a, b] as a topological space with the topology 
it inherits from R. This space has a subbase consisting of intervals of the type [a, c) or (c, b] 
fora<c<b. 


Any collection of subsets S of a nonempty set X that covers X is a subbase for a unique 
topology on X since it is not difficult to see, using Proposition 2, that intersections of finite 
subcollections of S is a base. 


Definition For a subset E of a topological space X, a point x € X is called a point of closure 
of E provided every neighborhood of x contains a point in E. The collection of points of 
closure of E is called the closure of E and denoted by E. 


It is clear that we always have E C E. If E contains all of its points of closure, that is, 
E = E, the set E is said to be closed. 


Proposition 3. For E a subset of a topological space X, its closure E is closed. Moreover, E 
is the smallest closed subset of X containing E in the sense that if F is closed and E C F, then 
ECF. 


Proof The set E is closed provided it contains all its points of closure. Let x be a point 
of closure of E. Consider a neighborhood U/, of x. There is a point x’€ ENU,. Since x’ 
is a point of closure of E and U, is a neighborhood of x’, there is a point x” Ee ENU,. 
Therefore every neighborhood of X contains a point of E and hence x¢ E. So the set E 
is closed. It is clear that if A C B, then A C B, so that if F is closed and contains E, then 
ECF=F. O 


Proposition 4 A subset of a topological space X is open if and only if its complement in X is 
closed. 


Proof First suppose E is open in X. Let x be a point of closure of X ~ E. Then x cannot 
belong to E because otherwise there would be a neighborhood «x that is contained in E and 
therefore does not intersect X ~ E. Thus x belongs to X ~ E and hence X ~ E is closed. Now 
suppose X ~ E is closed. Let x belong to E. Then there must be a neighborhood of x that is 
contained in E, for otherwise every neighborhood of x would contain points in X ~ E and 
therefore x would be a point of closure of X ~ E. Since X ~ E is closed, x would belong to 
X ~ E, This is a contradiction. U 


Since X ~ [X ~ E] = E, it follows from the preceding proposition that a subset of a 
topological space X is closed if and only if its complement in X is open. Therefore, by De 
Morgan’s Identities, the collection of closed subsets of a topological space possesses the 
following properties. , 


Proposition 5 Let X be a topological space. The empty-set @ and the whole set X are closed; 
the union of any finite collection of closed subsets of X is closed; and the intersection of any 
collection of closed subsets of X is closed. 
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PROBLEMS 


1. Show that the discrete topology for a nonempty set X is a metric topology. 


2. Show that the discrete topology on a set has a unique base. 


10. 


11. 


. Regarding Proposition 2, show that if B is a base for a topology, then properties (i) and (ii) 
hold. 
. Let T; and T2 be topologies for a nonempty set X. Show that T, = T> if and only if there 


are bases B, for J; and B> for T2 that are related as follows at each point x in X: for each 
neighborhood NV, of x belonging to By, there is a neighborhood N 2 of x belonging to B> for 
which N’2 C N and for each neighborhood N> of x belonging to Bo, there is a neighborhood 
N of x belonging to B, for which NV, CN». 


. Let E be a subset of a topological space X. 


(i) A point x € X is called an interior point of E provided there is a neighborhood of x that 
is contained in E: the collection of interior points of E is called the interior of E and 
denoted by int E. Show that int E is always open and E is open if and only if E = int E. 


(i) A point x X is called an exterior point of E provided there is a neighborhood of x 
that is contained in X ~ E: the collection of exterior points of E is called the exterior 
of E and denoted by ext FE. Show that ext E is always open and E is open if and only if 
E~E CextE. 


(it) A point x € X is called a boundary point of E provided every neighborhood of x contains 
points in E and points in X ~ E: the collection of boundary points of E is called the 
boundary of E and denoted by bd E. Show that (i) bd E is always closed, (ii) E is open if 
and only if EM bd E = 9, and (iii) E is closed if and only if bd EC E. 


. Let A and B be subsets of a topological space X. Show that if A C B, then A C B. Also, show 


that (AUB) =AUBand(ANB)CANB. 


- Let O be an open subset of a topological space X. For a subset E of X, show that 0 is disjoint 


from E if and only if it is disjoint from E. 


. For a collection S of subsets of a nonempty set X, show that there is a topology 7 on X that 


contains the collection S and has the property that any other topology that contains S also 
contains 7: it is the topology with the fewest sets that contains S. 


. (The Sorgenfrey Line) Show that the collection of intervals of the form [a, b), where a < b, 


is a base for a topology for the set of real numbers R. The set of real numbers R with this 
topology is called the Sorgenfrey Line. 


(The Moore Plane) Consider the upper half plane, R*+ = {(x, y) €R?| y > 0.} For points 
(x, y) with y > 0, take as a basic open neighborhood a usual Euclidean open ball centered at 
(x, y) and contained in the upper half plane. As a basic open neighborhood of a point (x, 0) 
take the set consisting of the point itself and all the points in an open Euclidean ball in the 
upper half plane that is tangent to the real line at (x, 0). Show that this collection of sets is a 
base. The set R**+ with this topology is called the Moore Plane. 


(Kuratowski 14-subset problem) 


(i) Let E be a subset of a topological space X. Show that at most 14 different sets can be 
obtained from E by repeated use of complementation and closure. 


(ii) Give an example in R* where there are 14 different sets coming from a suitable E. 
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11.2 THE SEPARATION PROPERTIES 


In order to establish interesting results for topological spaces and continuous mappings 
between such spaces, it is necessary to enrich the rudimentary topological structure. In this 
section we consider so-called separation properties for a topology on a set X, which ensure 
that the topology discriminates between certain disjoint pairs of sets and, as a consequence, 
ensure that there is a robust collection of continuous real-valued functions on X. 

We have defined what we mean by a neighborhood of a point in a topological space. 
For a subset K of a topological space X, by a neighborhood of K we mean an open set that 
contains K. We say that two disjoint subsets A and B of X can be separated by disjoint 
neighborhoods provided there are neighborhoods of A and B, respectively, that are disjoint. 
For a topological space X, we consider the following four separation properties: 


The Tychonoff Separation Property For each two points u and v in X, there is a 
neighborhood of u that does not contain v and a neighborhood of v that does not 
contain u. 


The Hausdorff Separation Property Each two points in X can be separated by 
disjoint neighborhoods. 


The Regular Separation Property The Tychonoff separation property holds and, 
moreover, each closed set and point not in the set can be separated by disjoint 
neighborhoods. 


The Normal Separation Property The Tychonoff separation property holds and, 
moreover, each two disjoint closed sets can be separated by disjoint neighborhoods. 


We naturally call a topological space Tychonoff, Hausdorff, regular, or normal, provided it 
satisfies the respective separation property. 


Proposition 6 A topological space X is a Tychonoff space if and only if every set consisting 
of a single point is closed. 


Proof Let x be in X. The set {x} is closed if and only if X ~ {x} is open. Now X ~ {x} is open 
if and only if for each point y in X ~ {x} there is a neighborhood of y that is contained in 
X ~ {x}, that is, there is a neighborhood of y that does not contain x. L 


Proposition 7 Every metric space is normal. 


Proof Let (X, p) be a metric space. Define the distance between a subset F of X and point 
xin X by 


dist(x, F) = inf {p(x, x’)| xin F}. 


Let F, and F) be closed disjoint subsets of X. Define 


O, ={xinX | dist(x, F,) < dist(x, F))} and O2 = {xin X | dist(x, F,) < dist(x, F,)}. 
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Since the complement of a closed set is open, dist( x, F) > 0if F is closed and x does 
not belong to F. Therefore F, C O; and F, C O2 and clearly, O, NO» = 9. Moreover, using 
the triangle inequality for the metric, it is not difficult to see that O, and Oz areopen. L] 


Using obvious notation, the preceding two propositions provide the following string of 
inclusions between families of topologies on a set X: 


T metric © Tnormal © T regular © T Hausdorff © T Tychonoff- 


’ We close this brief section with the following very useful reformulation of normality in 
terms of nested neighborhoods of closed sets. 


Proposition 8 Let X be a Tychoneff topological space. Then X is normal if and only if 
whenever U is a neighborhood of a closed subset F of X, there is another neighborhood of F 
whose closure is contained in U, that is, there is an open set O for which 


FCOCOCY. 


Proof First assume X is normal. Since F and X ~ U are disjoint closed sets, there are disjoint 
open sets O and V for which F CO and X~U CY. Thus OC X~V CU. Since O C X-Vy 
and X~V is closed, OC X~Y CU. 


To prove the converse, suppose the nested neighborhood property holds. Let A and B 
be disjoint closed subset of X. Then A C X ~ Band X ~ Bis open. Thus there is an open set 
O for which AC OC OC X~B. Therefore © and X~O are disjoint neighborhoods of A 
and B, respectively. LO 


PROBLEMS 


12. Show that if F is a closed subset of a normal space X, then the subspace F is normal. Is it 
necessary to assume that F is closed? 


13. Let X be a topological space. Show that X is Hausdorff if and only if the diagonal D = 
{(x1, x2) €X X X | xy = x2) is a closed subset of X x X. 

14. Consider the set of real numbers with the topology consisting of the empty-set and sets of the 
form (—oo, c),c € R. Show that this space is Tychonoff but not Hausdorff. 


15. (Zariski Topology) In R" let B be the family of sets {x € R” | p(x) #0}, where pis a polynomial 
in n variables. Let T be the topology on X that has B as a subbase. Show that T is a topology 
for R” that is Tychonoff but not Hausdorff. 


16. Show the Sorgenfrey Line and the Moore Plane are Hausdorff (see Problems 9 and 10). 
11.3 COUNTABILITY AND SEPARABILITY 


We have defined what it means for a sequence in a metric space to converge. The following 
is the natural generalization of sequential convergence to topological spaces. 


Definition A sequence {x,} in a topological space X is said to converge to the point x € X 
provided for each neighborhood U of x, there is an index N such that if n > N, then Xp, belongs 
to U. The point x is called a limit of the sequence. 
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In a metric space, a sequence cannot converge to two different points so we refer to 
the limit of a sequence. In a general topological space, a sequence can converge to different 
points. For instance, for the trivial topology on a set, every sequence converges to every 
point. For a Hausdorff space, a sequence has a unique limit. 


Definition A topological space X is said to be first countable provided there is a countable 
base at each point. The space X is said to be second countable provided there is a countable 
base for the topology. 


_ It is clear that a second countable space is first countable. 


Example Every metric space X is first countable since for x € X, the countable collection of 
open balls {B(x, 1/n)}°° 1 18 a base at x for the topology induced by the metric. 


We leave the proof of the following proposition as an exercise. 


Proposition 9 Let X be a first countable topological space. For a subset E of X, a point x € X 
is a point of closure of E if and only if x is a limit of a sequence in E. Therefore a subset E of X 
is closed if and only if whenever a sequence in E converges to x € X, the point x belongs to E. 


In a topological space that is not first countable, it is possible for a point to be a point 
of closure of a set and yet no sequence in the set converges to the point (see Problem 22). 


Definition A subset E of topological space X is said to be dense in X provided every open set 
in X contains a point of E. We call X separable provided it has a countable dense subset. 


It is clear that a set E is dense in X if and only if every point in X is a point of closure 
of E, thatis, E = X. | 

In Chapter 9, we proved that a metric space is second countable if and only if it 
is separable. In a general topological space, a second countable space is separable, but 
a separable space, even one that is first countable, may fail to be second countable (see 
Problem 21), 

A topological space is said to be metrizable provided the topology is induced by a 
metric. Not every topology is induced by a metric. Indeed, we have seen that a metric space is 
normal, so certainly the trivial topology on a set with more than one point is not metrizable. 
It is natural to ask if it is possible to identify those topological spaces that are metrizable. By 
this we mean to state criteria in terms of the open sets of the topology that are necessary 
and sufficient in order that the topology be induced by a metric. There are such criteria.” In 
the case the topological space X is second countable, there is the following simple necessary 
and sufficient criterion for metrizability. 


The Urysohn Metrization Theorem Let X be a second countable topological space. Then X 
is metrizable if and only if it is normal. 


We already have shown that a metric space is normal. We postpone until the next » 
chapter the proof, for second countable topological spaces, of the converse. 


2The Nagata- Smirnov- —Bing Metrization Theorem is such a result; See page 127 of John Kelley’s General 
Topology [Kel55}. 
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PROBLEMS 


17. A topological space is said to be a Lindeléf space or to have the Lindel6of pro- 
perty provided each open cover of X has a countable subcover. Show that if X is sec- 
ond countable, then it is Lindel6f. 


18. Let X be an uncountable set of points, and let J consist of @ and all subsets of X that have 
finite complements. Show that T is a topology for X and that the space (X, 7) is not first 
countable. 


19. Show that a second countable space is separable and every subspace of a second countable 
space is second countable. 


20. Show that the Moore Plane is separable (see Problem 10). Show that the subspace R X {0} of 
the Moore Plane is not separable. Conclude that the Moore Plane is not metrizable and not 
second countable. 


21. Show that the Sorgenfrey Line is first countable but not second countable and yet the rationals 
are dense (see Problem 9). Conclude that the Sorgenfrey Line is not metrizable. 


22. Let X} = N XN, where N denotes the set of natural numbers and take X = X 1 U {w}, where 


w does not belong to X;. For each sequence s = {m,} of natural numbers and natural number 
n, define 
Bon = {w} U{(j, k) 2 j = my all k > n}. 


ti) Show that the sets B, , together with the singleton sets {(j, k)} form a base for a topology 
on X. 


(i!) Show that w is a point of closure of X; even though no sequence {x,} from X, converges 
to w. 


(iii) Show that the space X is separable but is not first countable and so is not second 
countable. 


(iv) Is X a Lindeléf space? 


11.4 CONTINUOUS MAPPINGS BETWEEN TOPOLOGICAL SPACES 


We defined continuity for mappings between metric spaces in terms of convergent sequences: 
A mapping f is continuous at a point x provided whenever a sequence converges to x the 
image sequence converges to f(x). We then showed that this was equivalent to the «-6 
criterion expressed in terms of open balls. The concept of continuity extends to mappings 
between topological spaces in the following natural manner. 


Definition For topological spaces (X, T) and (Y, S), a mapping f: XY is said to be 
continuous at the point xo in X provided for any neighborhood © of f(xo), there is a 
neighborhood U of xo for which f(U) CO. We say f is continuous provided it is continuous 
at each point in X. 


Proposition 10 A mapping f : X — Y between topological spaces X and Y is continuous if and 
only if for any open subset O in Y, its inverse image under f, f—'(©), is an open subset of X. 


Proof First suppose that f is continuous. Let O be open in Y. According to Proposition 1, to 
show that f~!() is open it suffices to show that each point in f—'(O) has a neighborhood 
that is contained in f~'(O). Let x belong to f—!(O). Then by the continuity of f at x 
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there is a neighborhood of x that is mapped into © and therefore is contained in f-(O). 
Conversely, if f~! maps open sets to open sets, then it is immediate that f is continuous on 
all of X. LI 


For a continuous mapping f of a topological space X to a topological space Y, by the 
definition of the subspace topology, the restriction of f to a subspace of X also is continuous. 
We leave the proof of the next proposition as an exercise. 


Proposition 11 The composition of continuous mappings between topological spaces, when 
defined, is continuous. 


Definition Given two topologies T, and T> for a set X, if T2 CT, we say that T2 is weaker 
than T, and that T, is stronger than T >. 


Given a cover S of a set X, it is useful to understand the topologies for X with respect 
to which the cover S is open. Of course, S is open with respect to the discrete topology on 
X. In fact, there is a weakest topology for X with respect to which this cover is open: it is the 
unique topology that has S as a subbase. We leave the proof of the following proposition as 
an exercise. 


Proposition 12 Let X be a nonempty set and S any collection of subsets of X that covers X. 
The collection of subsets of X consisting of intersections of finite subcollections of S is a base 
for a topology T for X. It is the weakest topology containing S in the sense that if T’ is any 
other topology for X containing S, then T CT’. 


Definition Let X be a nonempty set and consider a collection of mappings F = { fy: X> 
XalacA, Where each Xq is a topological space. The weakest topology for X that contains the 
collection of sets 


F= if a *(Oa) 
is called the weak topology for X induced by F. 


fa € F, Og open in Xa} 


Proposition 13 Let X be anonempty set and F = { fy: X > Xy}y <A 4 collection of mappings 
where each X) is a topological space. The weak topology for X induced by F is the topology 
on X that has the fewest number of sets among the topologies on X for which each mapping 
fy: X > X) is continuous. 


Proof According to Proposition 10, for each Ain A, f,: X—> X, is continuous if and only if 
the inverse image under f, of each open set in X, is open in X. LI 


Definition A continuous mapping from a topological space X to a topological space Y is 
said to be a homeomorphism provided it is one-to-one, maps X onto Y, and has a continuous 
inverse f—' from Y to X. 


It is clear that the inverse of a homeomorphism is a homeomorphism and that the 
composition of homeomorphisms, when defined, is a homeomorphism. Two topological 
spaces X and ¥ are said to be homeomorphic if there is a homeomorphism between them. 
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This is an equivalence relation among topological spaces, that is, it is reflexive, symmetric, 
and transitive. From a topological point of view two homeomorphic topological spaces are 
indistinguishable since, according to Proposition 10, for a homeomorphism f of X onto Y, a 
set E is open in X if and only if its image f( £) is open in Y. The concept of homeomorphism 
plays the same role for topological spaces that isometry plays for metric spaces and, say, 
group isomorphism plays for groups. But some care is needed here. In the next example 
we show that, for E a Lebesgue measurable set of real numbers, L'(E) is homeomorphic 
to L*(E).- 


Example (Mazur) Let E be a Lebesgue measurable set of real numbers. For f in L'(E), 
define the function ®( f) on E by ®( f)(x) = sgn(f(x))|f(x)|'/2. Then ®( f) belongs to 
L?( E). We leave it as an exercise to show that for any two numbers a and b, 


2 
sgn (a) - |al'/? — sgn (b) - |b|'/*| <2-|a-5I, 


and therefore 

I®(F) — P(g) <2- If — sll: forall f, gin L'(E). 
From this we conclude that ® is a continuous one-to-one mapping of L'( E) into L?( E). It also 
maps L'(E) onto L*(E£) and its inverse ®~! is defined by ®—!( f )(x) = sgn( f(x))| f(x)? 
for f in L?(E). Use Problem 38 to conclude that the inverse mapping ®~! is a continuous 
mapping from L?(E) to L'(E). Therefore L!(£) is homeomorphic to L*(E), where each 
of these spaces is equipped with the topology induced by its L? norm. 


PROBLEMS 


23. Let f be a mapping of the topological space X to the topological space Y and S be a subbase 
for the topology on Y. Show that f is continuous if and only if the inverse image under f of 
every set in S is open in X. 

24. Let X be a topological space. 

(i) If X has the trivial topology, find all continuous mappings of X into R. 
(ii) If X has the discrete topology, find all continuous mappings of X into R. 
(iti) Find all continuous one-to-one mappings from R to X if X has the discrete topology. 


(iv) Find all continuous one-to-one mappings from R to X if X has the trivial topology. 


25. For topological spaces X and Y, let f map X to Y. Which of the following assertions are 
equivalent to the continuity of f? Verify your answers. 


(i) The inverse image under f of every closed subset of Y is closed in X. 
(ii) If O is open in X, then f(O) is open in Y. 
(iii) If F is closed in X, then f(F’) is closed in Y. 


(iv) For each subset A of X, f(A) C f(A). 


3The same type of argument shows that any two L?(E) spaces, for 1 < p < oo, are homeomorphic. There is a 
remarkable theorem due to M.I. Kadets which tells us that any two separable infinite dimensional complete normed 
linear spaces are homeomorphic (“A Proof of the Topological Equivalence of All Separable Infinite Dimensional 
Banach Spaces,” Functional Analysis and Applications, 1, 1967). From the topological point of view, L7[0, 1] is 
indistinguishable from C[0, 1]. These spaces look very different from many other angles of vision. 


26. 
27. 
28. 


29. 


30. 


31, 
32. 
33. 
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35. 
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Prove Proposition 11. 
Prove Proposition 12. 


Prove that the sum and product of two real-valued continuous functions defined on a 
topological space X are themselves continuous. 


Let F be a collection of real-valued functions on a set X. Find necessary and sufficient 
conditions on F in order that X, considered as a topological space with the weak topology 
induced by F, is Tychonoff. 


For topological spaces X and Y, let the mapping f: X > Y be one-to-one and onto. Show 
that the following assertions are equivalent. 


(i) f isa homeomorphism of X onto Y. | 

(ii) A subset E of X is open in X if and only if f(£) is open in Y. 

(iii) A subset E of X is closed in X if and only if f(E£) is closed in Y. 

(iv) The image of the closure of a set is the closure of the image, that is, for each subset A of 
X, f(A) = f(A). 


For topological spaces X and Y, let f be a continuous mapping from X onto Y. If X is 
Hausdorff, is Y Hausdorff? If X is normal, is Y normal? 


Let p, and pz be metrics on the set X that induce topologies T; and T>2, respectively. If 
T 1 = T>2, are the metrics necessarily equivalent? 


Show that the inverse of a homeomorphism i is a homeomorphism and the composition of two 
homeomorphisms, when defined, is again a homeomorphism. 


Suppose that a topological space X has the property that every continuous real-valued 
function on X takes a minimum value. Show that any topological space that is homeomorphic 
to X also possesses this property. 


Suppose that a topological Space X has the property that every continuous real-valued function - 
on X has an interval as its image. Show that any topological space that is homeomorphic to X 
also possesses this property. 


Show that R is homeomorphic to the open bounded interval (0, 1), but is not homeomorphic 
to the closed bounded interval [0, 1]. | 


Let X and Y be topological spaces and consider a mapping f from X to Y. Suppose X = Xj UX, 
and the restrictions of f to the topological subspaces X; and to X2 are continuous. Show 
that f need not be continuous at any point in X. Show that f is continuous on X if X, and 
X2 are open. Compare this with the case of measurable functions and the inheritance of 
measurability from the measurability of restrictions. : 


Show that for any two numbers a and b, 


sgn (a) - al” — sgn (b) - |b|?| <2 |a — b|((al + [b1). 


11.5 COMPACT TOPOLOGICAL SPACES 


We have studied compactness for metric spaces. We provided several characterizations of 
compactness and established properties of continuous mappings and continuous real-valued 
functions defined on compact metric spaces. The concept of compactness can be naturally 
and usefully extended to topological spaces. 
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Recall that a collection of sets {E)},-q is said to be a cover of a set E provided 
E C Uyeca Fa. If each E, is contained in a topological space, a cover is said to be open 
provided each set in the cover is open. 


Definition A topological space X is said to be compact provided every open cover of X has 
a finite subcover. A subset K of X is called compact provided K, considered as a topological 
space with the subspace topology inherited from X, is compact. 


In view of the definition of the subspace topology, a subset K of X is compact provided 
every covering of K by a collection of open subsets of X has a finite subcover. 

Certain results regarding compactness in a topological space carry over directly from 
the metric space setting; for example, the image of a compact topological space under a 
continuous mapping also is compact. Other properties of compact metric spaces, for example, 
the equivalence of compactness and sequential compactness, carry over to the topological 
setting only for spaces that possess some additional topological structure. Other properties 
of compact metric spaces, such as total boundedness, have no simple correspondent in the 
topological setting. 

Recall that a collection of sets is said to have the finite intersection property provided 
every finite subcollection has nonempty intersection. Since a subset of a topological space X is 
closed if and only if its complement in X is open, we have, by De Morgan’s Identities, the fol- 
lowing extension to topological spaces of a result we previously established for metric spaces. 


Proposition 14 A topological space X is compact if and only if every collection of closed 
subsets of X that possesses the finite intersection property has nonempty intersection. 


Proposition 15 A closed subset K of a compact topological space X is compact. 


Proof Let {O,},<, be an open cover for K by open subsets of X. Since X ~ K is an open 
subset of X, [X ~ F] U {O)},cq is an open cover of X. By the compactness of X this cover 
has a finite subcover, and, by possibly removing the set X ~ K from this finite subcover, the 
remaining collection is a finite subcollection of {O,},<q that covers K. Thus K is compact. 


We proved that a compact subspace K of a metric space X must be a closed subset of 
X. This is also true for topological spaces that are Hausdorff. 


Proposition 16 A compact subspace K of a Hausdorff topological space X is a closed subset 
of K. 


Proof We will show that X ~ K is open so that K must be closed. Let y belong to X~ K. 
Since X is Hausdorff, for each x € K there are disjoint neighborhoods O, and U,, of x and 
y, respectively. Then {O,},¢x is an open cover of K, and so, since K is compact, there is a 
finite subcover {O;,,Ox,,...,Ox,}. Define VN = ()?_, Ux,. Then N is a neighborhood of y 
which is disjoint from each O,, and hence is contained in X ~ K. Therefore X ~ K is open. LJ 


Definition A topological space X is said to be sequentially compact provided each sequence 
in X has a subsequence that converges to a point of X. 
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We have shown that a metric space is compact if and only if it is sequentially compact. 
The same holds for topological spaces that a are second countable. | 


Proposition 17 Let X be a second countable topological space. Then X is compact if and 
only if it is sequentially compact. 


Proof First assume X is compact. Let {x,} be a sequence in X. For each index n, let F, 
be the closure of the nonempty set {xz|k > n}. Then {F,} is a descending sequence of 
nonempty closed sets. Since {F,} has the finite intersection property, by Proposition 14, 
Ara1 Fn #:, choose a point xo in this intersection. Since X is second countable, it is first 
countable. Let {B,}°° , be a base for the topology at the point x9. We may assume that each 
B41 C By. Since xo belongs to the closure of {x,|k > nj, for each n, the neighborhood B, 
has nonempty intersection with {x;|k > n}. Therefore we may inductively select a strictly 
increasing sequence of indices {nj} such that for each index k, xn, € By. Since for each 
neighborhood © of xo, there is an index N for which B, C O for n > N, the subsequence 
{Xn,} converges to xo. Thus X is sequentially compact. 


Now suppose X is sequentially compact. Since X is second countable, every open cover 
has a countable subcover. Therefore, to show that X is compact it suffices to show that every 
countable open cover of X has a finite subcover. Let {O,,}°°, be such a cover. We argue by con- 
tradiction. Assume there is no finite subcover. Then for each index n, there is anindex m(n)> 
n for which O. m(n) ~ , Oi #9. For each natural number n, choose x, € Om(n) ~ Uy Oi- 
Then, since X is sequentially compact, a subsequence of {x,} converges to Me eX. But 
{O,,}°°_, is an open cover of X, so there is some Oy that is a neighborhood of xo. Therefore, 
there are infinitely many indices n for which x, belongs to Oy. This is not possible since 
Xn EO n forn > N. L 


Theorem 18 A compact Hausdorff space is normal. 


Proof Let X be compact and Hausdorff. We first show it is regular, that is, each closed 
set and point not in the set can be separated by disjoint neighborhoods. Let F be a closed 
subset of X and x belong to X ~ F. Since X is Hausdorff, for each y € F there are disjoint 
neighborhoods O, and Uy, of x and y, respectively. Then {Uy}yeF 1S an open cover of F. 
But F is compact. "Thus here i is a finite subcover (U),, Uy,,..., Uy,}. Define MW =, Oy,. 

Then AN is a neighborhood of y which is disjoint from Ut Uy, a neighborhood of F. 

Thus X is regular. A repeat of this argument, now using regularity, shows that X is 
normal. O 


Proposition 19 A continuous one-to-one mapping f of a compact space X onto a Hausdorff 
space Y is a homeomorphism. 


Proof In order to show that f is a homeomorphism it is only necessary to show that it carries 
open sets into open sets or equivalently closed sets into closed sets. Let F be a closed subset 
of X. Then F is compact since X is compact. Therefore, by Proposition 20, f(F) is compact. 
Hence, by Proposition 16, since Y is Hausdorff, f( F’) is closed. 


Proposition 20 The continuous image of a compact topological space is compact. 
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Proof Let f be a continuous mapping of compact topological space X to a topological space 
Y. Let {Oy},cq be a covering of f(X) by open subsets of Y. Then, by the continuity of 
f, (f-'(Ox)}aea is an open cover of X. By the compactness of X, there is a finite subcol- 
lection {f~!(Oa,),..-, f-'(Oy, )} that also covers X. The finite collection {O),,...,O,,} 
covers f(X). O 


Corollary 21 A continuous real-valued function on a compact topological space takes a 
maximum and minimum functional value. 


Proof Let X be compact and f: X — R be continuous. By the preceding proposition, f(X) 
is a compact set of real numbers. Thus f(X) is closed and bounded. But a closed and 
bounded set of real numbers contains a smallest and largest member. C] 


A topological space is said to be countably compact provided every countable open 
cover has a finite subcover. We explore some properties of such spaces in Problems 39 and 40. 


PROBLEMS 
39. For a second countable space X, show that X is compact if and only if it is countably compact. 


40. (Fréchet Intersection Theorem) Let X be a topological space. Prove that X is countably 
compact if and only if whenever {F,,} is a descending sequence of nonempty closed subsets of 


X, the intersection ()°° , F,, is nonempty. 


41. Let X be compact Hausdorff and {F,,}°° , be a descending collection of closed subsets of X. 
Let © be a neighborhood of the intersection (\"°_, Fn. Show there is an index N such that 
F, CO forn > N. 


42. Show that it is not possible to express a closed, bounded interval of real numbers as the 
pairwise disjoint union of a countable collection (having more than one member) of closed, 
bounded intervals. 


43. Let f be a continuous mapping of the compact space X onto the Hausdorff space Y. Show 
that any mapping g of Y into Z for which g o f is continuous must itself be continuous. 
44. Let (X, T ) be a topological space. 
(i) Prove that if (X, 7) is compact, then (X, 7, ) is compact for any topology 7, weaker 
than T. 
(ii) Show that if (X, 7 ) is Hausdorff, then (X, Tz) is Hausdorff for any topology T 2 stronger 
than T. 


(iii) Show that if (X, T ) is compact and Hausdorff, then any strictly weaker topology is not 
Hausdorff and any strictly stronger topology is not compact. 


45. (The Compact-Open Topology) Let X and Y be Hausdorff topological spaces and Y~ the 
collection of maps from X into Y. On Y¥* we define a topology, called the compact-open 
topology, by taking as a subbase sets of the form Ux.o = {f: X>Y| f(K) CO}, where K 
is a compact subset of X and ( is an open subset of Y. Thus the compact-open topology is the 
weakest topology on Y~* such that the sets Ux «@ are open. 

(i) Let {f,} be a sequence in Y* that converges with respect to the compact-open topology 
to f ¢ Y*. Show that { f,} converges pointwise to f on X. 


Section 11.6 Connected Topological Spaces 237 


(ii) Now assume that Y is a metric space. Show that a sequence { f,} in Y* converges with 
respect to the compact-open topology to f ¢Y* if and only if {fy} converges to f 
_ uniformly on each compact subset K of X. 


46. (Dini’s Theorem) Let { f,,} be a sequence of continuous real-valued functions on a countably 
compact space X. Suppose that for each x € X, the sequence { f,,(x)} decreases monotonically 
to zero. Show that { f,,} converges to zero uniformly. 


11.6 CONNECTED TOPOLOGICAL SPACES 


Two nonempty open subsets of a topological space X are said to separate X if they are 
disjoint and their union is X. A topological space which cannot be separated by such a pair 
is said to be connected. Since the complement of an open set is closed, each of the open sets 
in a separation of a space is also closed. Thus a topological space is connected if and only if 
the only subsets that are both open and closed are the whole space and the empty-set. 

A subset E of X is said to be connected provided it is a connected topological subspace. 
Thus a subset E of X is connected if there do not exist open subsets O, and O> of X for which 


O,NE#G, O.NE#G, ECO,UO, and ENO, NO =9. 


Proposition 22 Let f be a continuous mapping of a connected space X toa topological space 
Y. Then its image f(Y) is connected. 


Proof Observe that f is a continuous mapping of X onto the topological space f(X), where 

f(X) has the subspace topology inherited from Y. We argue by contradiction. Suppose 
f (X) is not connected. Let O; and O, be a separation of f(X). Then f—!(O;) and f—!(O2) 
are disjoint nonempty open sets in X whose union is X. Thus this pair is a separation of X in 
contradiction to the connectedness of X. U 


We leave it as an exercise to show that for a set C of real numbers, the following are 
equivalent, 
(i) C is an interval; (ii) C is convex; (iii) C is connected. (1) 


Definition A topological space X is said to have the intermediate value property provided the 
image of any continuous real-valued function on X is an interval. 


Proposition 23 A topological space has the intermediate value property if and only if it is 
connected. | 


Proof According to (1), a connected set of real numbers is an interval. We therefore infer 
from Proposition 22 that a connected topological space has the intermediate value property. 
To prove the converse, we suppose that X is a topological space that is not connected 
and conclude that it fails to have the intermediate value property. Indeed, since X is not 
connected, there is a pair of nonempty open subsets of X,O, and ©», for which X = 0; UO). 

Define the function f on X to take the value 0 on O, and 1 on O>. Then f is continuous since 
f—'(A) is an open subset of X for every subset A of R and hence, in particular, for every 
open subset of R. On the other hand, f fails to have the intermediate value property. L 
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If a topological space is not connected, then for any separation of the space, a subspace 


that has nonempty intersection with each of the sets in the separation also fails to be 
connected. Moreover, the image under a continuous map of an interval of real numbers 
is connected. Thus a.topological space X is connected if for each pair of points u, ve X, 
there is a continuous map f: [0,1]—> X for which f(0) = u and f(1) = v. A topological 
space possessing this property is said to be arewise connected. While an arcwise-connected 
topological space is connected, there are connected spaces that fail to be arcwise connected 
(see Problem 49). However, for an open subset of a Euclidean space R", connectednes is 
equivalent to arcwise connectedness (see Problem 50). 


47. 


50. 


51. 
52. 
53. 


54. 


55. 


56. 


PROBLEMS 


Let {Cy} vcd be a collection of connected subsets of a topological space X and suppose that 
any two of them have a point in common. Show that the union of {Cy}, <q also is connected. 


. Let A be aconnected subset of a topological space X, and suppose A C BC A. Show that B is 


connected. 


. Show that the following subset of the plane is connected but not arcwise connected. 


- X={(x, y)| x=0, -l<y<1}U{(x, y)| y=sin1/x, 0<x<1}. 


Show that an arcwise connected topological space X is connected. Also show that each 
connected open subset O of a Euclidean space R" is arcwise connected. (Hint: Let x belong 
to O. Define C to be the set of points in O that can be connected in © to x by a piecewise 
linear arc. Show that C is both open and closed in O.) 


Consider the circle C = {(x, y) |x? + y* = 1} in the plane R2. Show that C is connected. 
Show that R” is connected. 


Show that a compact metric space (X, p) fails to be connected if and only if there are two 
disjoint, nonempty subsets A and B whose union is X and € > 0 such that p(u, v) > € for all 
u € A, ve B. Show that this is not necessarily the case for noncompact metric spaces. 


A metric space (X, p) is said to be well chained provided for each pair of points u, v€ X 
and each € > 0, there is a finite number of points in X, u = x9, X1,...,%n—1, Xn = v such that 
p(xi-1, x1) <¢€, forl <i<n. 

(1) Show that if X is connected, then it is well chained, but the converse is not true. 

(ii) Show that if X is compact and well chained, then it is connected. 


(iii) Show that if an open subset of R" is well chained, then it is connected. 


Show that for any point (x, y) in the plane R’, the subspace R? ~ {(x, y)} is connected. Use 
this to show that R is not homeomorphic to R. 


Verify the equivalence of the three assertions in (1). 


I 
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In the preceding chapter we considered several different topological concepts and examined 
relationships between these concepts. In this chapter we focus on three theorems in topology 
that, beyond their intrinsic interest, are indispensible tools in several areas of analysis. 


12.1 URYSOHN’S LEMMA AND THE TIETZE EXTENSION THEOREM 


On a metric space (X, p) there is an an abundance of continuous real-valued functions. 
Indeed, for a nonempty closed subset C of X, the function dc, called the distance to C and 
defined by 


dc(x) = inf (2, x) for all x € X, 


is continuous and C is the inverse image under dc of 0. Continuity follows from the triangle 
inequality. Moreover, if A and B are disjoint closed subsets of X, there is a continuous 
real-valued function f on X for which 


f(X) C[0, 1], f =00n A and f =1onB. 


The function f is given by 
da 
f= 
da +dp 
This explicit construction of f depends on the metric on X. However, the next fundamental 
lemma tells us that there exist such functions on any normal topological space and, in 
particular, on any compact Hausdorff space. 


on X. 


¥ 


Urysohn’s Lemma Let A and B be disjoint closed subsets of a normal topological space 
X. Then for any closed, bounded interval of real numbers [a, b], there is a continuous 
real-valued function f defined on X that takes values in [a, b], while f = a on A and 
f =boneB. 


This lemma may be considered to be an extension result: Indeed, define the real-valued 
function f on AU B by setting f = a on A and f = b on B. This is a continuous function 
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on the closed subset A U B of X which takes values in [a, b]. Urysohn’s Lemma asserts that 
this function can be extended to a continuous function on all of X which also takes values in 
[a, b]. We note that if a Tychonoff topological space X possesses the property described in 
Urysohn’s Lemma, then X must be normal. Indeed, for A and B nonempty disjoint closed 
subsets of X and a continuous real-valued function f on X that takes the value 0 on A and 1 
on B, if I; and Jy are disjoint open intervals containing 0 and 1, respectively, then f~!(J; ) 
and f~'(1,) are disjoint neighborhoods of A and B, respectively. 

The proof of Urysohn’s Lemma becomes clearer if we introduce the following concept 
and then establish two preliminary results. 


Definition Let X be a topological space and A a set of real numbers. A collection of open 
subsets of X indexed by A, {Oy}yc,, is said to be normally ascending provided for any 
Ay ,AE A, 

Or, CO, if ky < dp. 


Example Let f be a continuous real-valued function on the topological space X. Let A be 
any set of real numbers and define, for A € A, 


Oy ={xeX| f(x) <A}. 
By continuity it is clear that if A; < Ao, then 
On, C{xeX| f(x) <M}C{xex| f(x) <r} =O, 
and therefore the collection of open sets {O,}, <q is normally ascending. 
We leave the proof of the following lemma as an exercise. 


Lemma 1 Let X be a topological space. For A a dense subset of the open, bounded interval 
of real numbers (a, b), let {Oy}, <4 be a normally ascending collection of open subsets of X. 
Define the function f : X — R by setting f = bon X ~ Uye, Oy and otherwise setting 


f(x) =inf {AEA| xeOQ,}. (1) 
Then f: X > [a, b] is continuous. 
We next provide a strong generalization of Proposition 8 of the preceding chapter. 


Lemma2 Let X be anormal topological space, F a closed subset of X, and U aneighborhood 
of F. Then for any open, bounded interval (a, b), there is a dense subset A of (a, b) and a 
normally ascending collection of open subsets of X, {O,},< a, for which 


FCO, CO, CU forallrAe A. (2) 


Proof Since there is a strictly increasing continuous function of (0, 1) onto (a, b) we may 
assume that (a, b) = (0, 1). For the dense subset of (0, 1) we choose the set of dyadic 
rationals belonging to (0, 1): 


A = {m/2" | m and n natural numbers, 1 < m <2" -1}. 


I a ll Dh el ann: 
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For each natural number n, let A,, be the subset of A whose elements have denominator 
2". We will inductively define a sequence of collections of normally ascending open sets 
{Oa}aea,, Where each indexing is an extension of its predecessor. 


By Proposition 8 of the preceding chapter, we may choose an open set O1/7 for which 
FCO CO CU. 


Thus we have defined {Oy}, ¢ 4,. Now we use Proposition 8 twice more, first with F the same 
and U = O;/ and then with F = Oy, and U the same, to find open sets O1/4 and O3y4 for 
which 7 : - 

F C0474 © O14 © O12 € O12 € 03/4 C O3/4 CU. 


Thus we have extended the normally ascending collection {O)}) < 4,to the normally ascending 
collection {O)},<,,. It is now clear how to proceed inductively to define for each natural 
number n, the normally ascending collection of open sets {O,}, ¢,,. Observe that the union 
of this countable collection is a normally ascending collection of open sets parametrized by 
A, each of which is a neighborhood of F that has compact closure contained in U. CI 


Proof of Urysohn’s Lemma By Lemma 2, applied with F = A and U = X~B, we can 
choose a dense subset A of (a, b) and a normally ascending collection of open subsets of X, 
{Oy}, ca, for which 

ACO, CX~B for all AEA. 


Define the function f: X — [a, b] by setting f = b on X~ Uy, Oy and otherwise setting 
f(x) =inf {AEA| xeO,}. 
Then f =aon A and f = bon B. Lemma 1 tells us that f is continuous. L 


We mentioned above that Urysohn’s Lemma may be considered to be an extension 
result. We now use this lemma to prove a much stronger extension theorem. 


The Tietze Extension Theorem Let X be anormal topological space, F a closed subset of X, 
and f acontinuous real-valued function on F that takes values in the closed, bounded interval 
[a, b]. Then f has a continuous extension to all of X that also takes values in [a, b}. 


Proof Since the closed, bounded intervals [a, b] and [—1, 1] are homeomorphic, it is 
sufficient, and also convenient, to consider the case [a, b] = [-1, 1]. We proceed by 
constructing a sequence {g,,} of continuous real-valued functions on X that has the following 
two properties: for each index n, 


[gn| < (2/3)" on X (3) 
and 
lf —[g1 +--+ + 8n]l < (2/3)” on F. (4) 


Indeed, suppose, for the moment, that this sequence of functions has been constructed. 
Define, for each index n, the real-valued function s, on X by 


M= 


Sn(x) = > gn(x) for xin X. 


k=1 
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We infer from the estimate (3) that, for each x in X, {s,(x)} is a Cauchy sequence of real 
numbers. Since R is complete, this sequence converges. Define 


g(x)= hm Sn(x) for x in X. 


Since each g, is continuous on X, so is each s,. We may infer from the estimate (3) that {s,} 
converges to g uniformly on X and therefore g is continuous. From the estimate (4) it is clear 
that f = g on F. Thus the theorem is proved provided we construct the sequence {g,}. We 
do so by induction. 


Claim: For each a > 0 and continuous function h: F — R for which |h| < a on F, there is a 
continuous function g: X — R such that 
lg| < (2/3)a on X and |h — g| < (2/3)aon F. (5) 
Indeed, define 
A= {xin F | h(x) < —(1/3)a} and B= {xin F | h(x) > (1/3)a}. 


Since h is continuous on F and F is a closed subset of X, A and B are disjoint closed subsets 
of X. Therefore, by Urysohn’s Lemma, there is a continuous real-valued function g on X for 
which 

lg| < (1/3)a on X, g(A) = —(1/3)a and g(B) = (1/3)a. 


It is clear that (5) holds for this choice of g. Apply the above approximation claim with h = f 
and a = 1 to find a continuous function g,: X > R for which 


lg1| < (2/3) on X and | f — g1| < (2/3) on F. 


Now apply the claim once more with h = f — g; and a = 2/3 to find a continuous function 
g2: X > R for which 


gal < (2/3)* on X and | f — [g1 + g2]| < (2/3)? on F. 


It is now clear how to proceed to inductively choose the sequence {g,} which possesses 
properties (3) and (4). 


The Tietze Extension Theorem has a generalization to real-valued functions on X that 
are not necessarily bounded (see Problem 8). 

As a second application of Urysohn’s Lemma, we present the following necessary and 
sufficient criterion for the metrizability of a second countable topological space. 


The Urysohn Metrization Theorem Let X be a second countable topological space. Then X 
is metrizable if and only if it is normal. 


Proof We have already shown that a metric space is normal. Now let X be a second countable 
and normal topological space. Choose a countable base {U,}n¢N for the topology. Let A be 
the subset of the product N x N defined by 


A={(n, m) in NXN | Un CUm}. 
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Since X is normal, according to Urysohn’s Lemma, for each pair (n, m) in A, there is a 
continuous real-valued function f, m: X — [0, 1] for which 


| fam =00nU, and f, m = 1o0n X~Um. 
For x, yin X, define 


1 , : 

p(x, y)= > Fat bfn,m(x) — tnm(y)I- (6) 
(n,m)EA 

The set A is countable so this sum converges. It is not difficult to see that this is a metric. We 

claim that the topology induced by p is the given topology on X. To verify this it is necessary 

to compare bases. Specifically, it is necessary to verify the following two properties at each 

point xe X: 


(i) If, contains x, then there is an € > 0 for which B,(x, €) CUn. 
(ii) For each € > 0, there is a U,, that contains x and U,, C B,(x, €). 


We leave the verification of these assertions as an exercise. oO 
PROBLEMS 
1. Let C be a closed subset of a metric space (X, p). Show that the distance to C function dc} is 
continuous and dc(x) = 0 if and only if x belongs to C. 


2. Provide an example of a continuous real-valued function on the open interval (0, 1) that 
is not extendable to a continuous function on R. Does this contradict the Tietze Extension 
Theorem? 


3. Deduce Urysohn’s Lemma as a consequence of the Tietze Extension Theorem. 
4. State and prove a version of the Tietze Extension Theorem for functions with values in R”. 


5. Suppose that a topological space X has the property that every continuous, bounded real- 
valued function on a closed subset has a continuous extension to all of X. Show that if X is 
Tychonoff, then it is normal. 


6. Let (X, T) be a normal topological space and F the collection of continuous real-valued 
functions on X. Show that T is the weak topology induced by F. 


7. Show that the function p defined in the proof of the Urysohn Metrization Theorem is a metric 
that defines the same topology as the given topology. 


8. Let X be a normal topological space, F a closed subset of X, and f a continuous real-valued 
function on F. Then f has a continuous extension to a real-valued function f on all of X. 
Prove this as follows: 


(i) Apply the Tietze Extension Theorem to obtain a continuous extension h: X > [0, 1] of 
the function f -(1+ [f|])~!: F > [0, 1]; 


(ii) Once more, apply the Tietze Extension Theorem to obtain a function ¢: X > [0, 1] such 
that ¢ = 10n F and dé =0onh"1(1); 


_ (iii) Consider the function f = ¢-h/(1-—@-h). 
9. Show that a mapping f from a topological space X to a topological space Y is continuous if 


and only if there is a subbase S for the topology on Y such that the preimage under f of each 
set in S is open in X. Use this to show that if Y is a closed, bounded interval [a, b], then f 
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is continuous if and only if for each real number c € (a, b), the sets {x €X| f(x) <c} and 
{xe X | f(x) > c} are open. 


10. Use the preceding problem to prove Lemma 1. 


12.2 THE TYCHONOFF PRODUCT THEOREM 


For a collection of sets indexed by a set A, {X,},-a, we defined the Cartesian product 
IT, «aX, to be the collection of mappings from the index set A to the union U, <q X, such 
that each index A € A is mapped to a member of X,. For a member x of the Cartesian product 
and an index A€« A, it is customary to denote x(A) by x, and call x, the A-th component of 
x. For each parameter Ag € A, we define the Ag projection mapping 7, : yc aX, Xa, by 


T(x) = Xo for x ETT) ea Xa. 


We have defined the product metric on the Cartesian product of two metric spaces. 
This extends in an obvious manner to a metric on the Cartesian product of a finite number 
of metric spaces. Moreover, there is a natural metric on the Cartesian product of a countable 
number of metric spaces (see Problem 16). 

There is a natural definition of a topology on the Cartesian product of a finite collection 
of topological spaces. Given a collection {(X;, T;,)}"_, of topological spaces, the collection 
of products 

O1X... On ...XOn, 


where each O; belongs to T,, is a base for a topology on IT; <;<,Xx. The topology on the 
Cartesian product consisting of unions of these basic sets is called the product topology on 
Nh <p <nXk- 

What is novel for topological spaces is that a product topology can be defined on an 
arbitrary Cartesian product II, < 4X, of topological spaces. The index set is not required to 
be finite or even countable. 


Definition Let {(X,, T))},¢a be a collection of topological spaces indexed by a set A. The 
product topology on the Cartesian product 11, <,X, is the topology that has as a base sets of 
the form II, <¢4Qy, where each O, €T, and O) = X), except for finitely many A. 


If all the X,’s are the same space X, it is customary to denote Il,¢,4X, by X4. In 
particular, if N denotes the set on natural numbers, then X‘ is the collection of sequences in 
X while R% is the collection of real-valued functions that have domain X. If X is a metric 
space and A is countable, then the product topology on X“ is induced by a metric (see 
Problem 16). In general, if X is a metric space but A is uncountable, the product topology 
is not induced by a metric. For example, the product topology on R® is not induced by a 
metric (see Problem 17). We leave it as an exercise to verify the following two propositions. 


Proposition 3 Let X be a topological space. A sequence { f,: A—> X} converges to f in the 
product space X* if and only if { fy ( d)} converges to f(A) for each A in A. Thus, convergence 
of a sequence with respect to the product topology is pointwise convergence. 


Proposition 4 The product topology on the Cartesian product of topological spaces T1, < 4X 
is the weak topology associated to the collection of projections {7,: TexXa—>Xahrea, 


a aL 
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that is, it is the topology on the Cartesian product that has the fewest number of sets among the 
topologies for which all the projection mappings are continuous. 


The centerpiece of this section is the Tychonoff Product Theorem, which tells us that 
the product IT, <4 X, of compact topological spaces is compact. There are no restrictions on 
the index space A. In preparation for the proof of this theorem we first establish two lemmas 
regarding collections of sets that possess the finite intersection property. 


Lemma 5 Let A be a collection of subsets of a set X that possesses the finite intersection 
property. Then there is a collection B of subsets of X which contains A, has the finite 
intersection property, and is maximal with respect to this property; that is, no collection of 
subsets of X that properly contains B possesses the finite intersection property. 


Proof Consider the family F of all collections of subsets of X containing A and possessing 
the finite intersection property. Order ¥ by inclusion. Every linearly ordered subfamily Fo 
of F has an upper bound consisting of the sets belonging to any collection in Fp. According 
to Zorn’s Lemma, there is maximal member of 7. This maximal member is a collection of 
sets that has the properties described in the conclusion of the lemma. O 


Lemma 6 Let B be a collection of subsets of X that is maximal with respect to the finite 
intersection property. Then each intersection of a finite number of sets in B is again in B, and 
each subset of X that has nonempty intersection with each set in B is itself in B. 


Proof Let B’ be the collection of all sets that are finite intersections of sets in B. Then B’ 
is a collection having the finite intersection property and containing B. Thus B’ = B by the 
maximality, with respect to inclusion, of B. Now suppose that C is a subset of X that has 
nonempty intersection with each member of B. Since B contains each finite intersection of 
sets in B, it follows that B U {C} has the finite intersection property. By the maximality, with 
respect to inclusion, of B, B U {C} = B, and so C is a member of B. 


The Tychonoff Product Theorem Let {X,}, <4 be a collection of compact topological spaces 
indexed by a set A. Then the Cartesian product 11) < 4X, with the product topology, also is 
compact. 


Proof Let F be a collection of closed subsets of X = II,<¢,X, possessing the finite 
intersection property. We must show F has nonempty intersection. By Lemma 5, there is 
a collection B of (not necessarily closed) subsets of X that contains F and is maximal with 
respect to the finite intersection property. Fix A € A. Define 


By = {7,(B) | BeB}. 


Then B) is a collection of subsets of the set X, that has the finite intersection property, as 
does the collection of closures of members of B,. By the compactness of X, there is a point 


x, € X, for which 
xr€ (-) 7(B). 


BeB 


246 Chapter12 Topological Spaces: Three Fundamental Theorems 


Define x to be the point in X whose A-th coordinate is x,. We claim that 


xe () F. (7) 


FeF 


Indeed, the point x has the property that for each index A, x, is a point of closure of w,(B) 
for every Be B. Thus! 


every subbasic neighborhood AN, of x has nonempty intersection 
with every set B in B. (8) 


From the maximality of B and Lemma 6, we conclude that every subbasic neighborhood of 
x belongs to B. Once more using Lemma 6, we conclude that every basic neighborhood of x 
belongs to B. But B has the finite intersection property and contains the collection ¥. Let F 
be a set in F. Then every basic neighborhood of x has nonempty intersection with F. Hence 
x is a point of closure of the closed set F, so that x belongs to F. Thus (7) holds. O 


PROBLEMS 


11. Show that the product of an arbitrary collection of Tychonoff spaces, with the product 
topology, also is Tychonoff. 


12. Show that the product of an arbitrary collection of Hausdorff spaces, with the product 
topology, also is Hausdorff. 


13. Consider the Cartesian product of n copies of R, 


n 
r————- 
R’ = RXR*X..-XR. 


Show that the product topology is the same as the metric topology on R” induced by the 
Euclidean metric. 


14. Let (X, p,) and (Y, pz) be metric spaces. Show that the product topology on X X Y, where 
X and X have the topologies induced by their respective metrics, is the same as the topology 
induced by the product metric 


p((x1, 1), (¥2, ¥2)) = Ver (x1, x2)? + [o2(v1, 2). 


15. Show that if X is a metric space with metric p, then 


_ _ P(x, y) 
1+ p(x, y) 


also is a metric on X and it induces the same topology as the metric p. 


p*(x, y) 


16. Consider the countable collection of metric spaces {(Xn, pn )}°°,. For the Cartesian product 
of these sets X = TT], Xn, define p: X XX > R by 


= S$ Pala Yn) 
p(x, y) = 2 2"[1 + pn(Xn,yn)] 


'Tt is convenient here to call an open set O set of the form O = Il, ¢ ,O),, where each Q) is an open subset of 
X, and O, = X, except for one A, a subbasic set and the finite intersection of such sets a basic set. 
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Use the preceding problem to show that p is a metric on X = [[°°, X, which induces the 


product topology on X, where each X,, has the topology induced by the metric pp. 

17. Consider the set X = R® with the product topology. Let E be the subset of X consisting of - 
functions that take the value 0 on a countable set and elsewhere take the value 1. Let fo be — 
the function that is identically zero. Then it is clear that fo is a point of closure of E. But 
there is no sequence { f,} in E that converges to fo, since for any sequence { f,} in E there is 
some xo € R such that fn(x0) = = 1 for all n and so the sequence { f,(xo )} does not converge 
to fo(xo). This shows, in particular, that X = RE i is not first countable and therefore not 
metrizable. 


18. Let X denote the discrete topological space with two elements. Show that X" ishomeomorphic 
to the Cantor set. 


19. Using the Tychonoff Product Theorem and the compactness of each closed, bounded interval 
of real numbers prove that any closed, bounded subset of R” is compact. 


20. Provide a direct proof of the assertion that if X is compact and / is a closed, bounded interval, 
then X X I is compact. (Hint: Let U/ be an open covering of X X J, and consider the smallest 
value of t € J such that for each r’ < t the set X x [0, r’] can be covered by a finite number of 
sets in U. Use the compactness of X to show that X x [0, t] can also be covered by a finite 
number of sets in U/ and that if t <1, then for some r” >t, X x[0, t’] can be covered by a finite 
number of sets in U.) 


21. Prove that the product of a countable number of sequentially compact topological spaces is 
sequentially compact. 


22. A product J“ of unit intervals is called a (generalized) cube. Prove that every compact 
Hausdorff space X is homeomorphic to a closed subset of some cube. (Let F be the family 
of continuous real-valued functions on X with values in [0, 1]. Let Q = ye #/,. Then, since 
X is normal, the mapping g of X onto Q that takes x into the point whose f-th coordinate 1 1S 

f(x) is one-to-one, continuous, and has closed image.) _ 


23. Let Q = I4 be acube, and let f be a continuous real-valued function on Q. Then, given € > 0, 
there is a continuous real-valued function g on Q for which | f — g| < € and g is a function 
of only a finite number of coordinates. (Hint: Cover the range of f by a finite number of 
intervals of length € and look at the inverse images of these intervals.) 


12.3 THE STONE-WEIERSTRASS THEOREM 


The following theorem is one of the jewels of classical analysis. 


The Weierstrass Approximation Theorem Let f be a continuous real-valued function on a 
closed, bounded interval {a, b]. Then for each € > 0, there is a polynomial p for which 


| f(x) — p(x)| < for all x € [a, Db]. 


In this section we provide a far-reaching extension of this theorem. For a compact 
Hausdorff space X, consider the linear space C(X) of continuous real-valued functions 
on X with the maximum norm. The Weierstrass Approximation Theorem tells us that the 
polynomials are dense in C[a, D]. 

Now C(X) has a product structure not possessed by all linear spaces, namely, the 
product fg of two functions f and gin C(X) is again in C(X). A linear subspace A of C(X) 


248 Chapter12° Topological Spaces: Three Fundamental Theorems 


is called an algebra provided the product of any two functions in A also belongs to A. A 
collection A of real-valued functions on X is said to separate points in X provided for any 
two distinct points u and v in X, there is an f in A for which f(u)# f(v). Observe that since 
X is compact and Hausdorff, according to Theorem 18 of the preceding chapter, it is normal 
and therefore we may infer from Urysohn’s Lemma that the whole algebra C(X) separates 
points in X. 


The Stone-Weierstrass Approximation Theorem Let X be a compact Hausdorff space. 
Suppose A is an algebra of continuous real-valued functions on X that separates points in X 
and contains the constant functions. Then A is dense in C(X). 


Observe that this is a generalization of the Weierstrass Approximation Theorem 
since the closed, bounded interval [a, b] is compact and Hausdorff and the collection of 
polynomials is an algebra that contains the constant functions and separates points. 

Before we prove the theorem, a few words concerning strategy are in order.” Since X is 
compact and Hausdorff, it is normal. We infer from Urysohn’s Lemma that for each pair of 
disjoint closed subsets A and B of X ande €(0, 1/2), there is a function f € C(X) for which 


f=e¢/2onA, f =1—6¢/20nB, ande/2 < f <1-6€/2onX. 
Therefore, if |h — f| <«/20n X, 
h<eonA,h>1-—eonB, and0<h<1lonXx. (9) 


The proof will proceed in two steps. First, we show that for each pair of disjoint closed 
subsets A and B of X and « € (0, 1/2), there is a function h belonging to the algebra A for 
which (9) holds. We then show that any function f in C(X) can be uniformly approximated 
by linear combinations of such h’s. 


Lemma7 Let X be a compact Hausdorff space and A an algebra of continuous functions on 
X that separates points and contains the constant functions. Then for each closed subset F of 
X and point xo belonging to X ~ F, there is a neighborhood U of xq that is disjoint from F 
and has the following property: for each « > 0, there is a function h € A for which 


h<eonU, h>1-eonF, and0<h<1lonXx. (10) 
Proof We first claim that for each point y € F, there is a function gy in A for which 
8y(x0) = 0, gy(y) > 0, and0 < gy, <1onX. (11) 
Indeed, since A separates points, there is a function f ¢A for which f(x0) # f(y). The 
function ; 
_| f-Ff(%0) 
Syl, +67. 
lf — F(%0)Ilmax 


2The proof we present is due to B. Brasowski and F. Deutsch, Proceedings of the American Mathematical Society, 
81 (1981). Many very different-looking proofs of the Stone-Weierstrass Theorem have been given since the first 
proof in 1937 by Marshal Stone. 
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belongs to A and satisfies (11). By the continuity of g,, there is a neighborhood N y Of yon 
which g, only takes positive values. However, F is a closed subset of the compact space X and 
therefore F itself is compact. Thus we may choose a finite collection of these neighborhoods 
{Ny,,.--,My,} that covers F. Define the function g € A by 


1 
B= 7 28 


Then | : 
g(xo) =0, g>O0onF, and0 <g<1lonX. (12) 


But a continuous function on a compact set takes a minimum value, so we may choose c > 0 
for which g > c on F. By possibly multiplying g by a positive number, we may suppose c < 1. 
On the other hand, g is continuous at xo, so there is a neighborhood U/ of xo for which g < c/2 
on U. Thus g belongs to the algebra A and 


g<c/2onu, g>conF, and0<g<1onX. (13) 


We claim that (10) holds for this choice of neighborhood U. Let ¢ > 0. By the Weierstrass 
Approximation Theorem, we can find a polynomial p such that? 


p<eon (0, c/2], p>1-—eon|[c, 1], and0 < p<1on[0, 1]. (14) 


Since p is a polynomial and f belongs to the algebra A, the composition h = po g also 
belongs to A. From (13) and (14) we conclude that (10) holds. O 


Lemma 8 Let X be a compact Hausdorff space and A an algebra of continuous functions 
on X that separates points and contains the constant functions. Then for each pair of disjoint 
closed subsets A and B of X and € > 0, there is a function h belonging to A for which 


h<eonA, h>1-—eonB, and0<h<1onXx. (15). 


Proof By the preceding lemma in the case F = B, for each point xe A, there is a 
neighborhood A’, of x that is disjoint from B and has the property (10). However, A is 
compact since it is a closed subset of the compact space X, and therefore there is a finite 
collection of neighborhoods {NV x>-+-, W,,} that covers A. Choose €p for which 0 < ey <« 
and (1 —¢9/n)" >1—e. For 1 <i <n, since N,, has property (10) with B = F, we choose 
h; € A such that | | | 


hi <€o/non N,,, hj >1—€9/n on B, and 0 <h; <10nX. 
Define | 
h=h,-hj---h, on X. 


Then h belongs to the algebra A. Since for each i, 0 < h; < 1 on X, we have O0<h <1onX. 
Also, for each i, hj > 1— €9/n on B,soh > (1—€9/n)" >1-—eonB. Finally, for each point 
xin A there is an index i for which x belongs to N’,,. Thus h;(x) <¢g/n < and since for the 
other indices j,0 < h(x) < 1, we conclude that h(x) <e. | L 


Rather than using the Weierstrass Approximation Theorem here, one can show that (14) holds for a polynomial 
of the form p(x) = 1—(1— x"), where n and m are suitably chosen natural numbers. 
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Proof of the Stone-Weierstrass Theorem Let f belong to C(X). Set c = || fIlmax. If we can 
arbitrarily closely uniformly approximate the function 

_ Ste 

Hf + cl|max 


by functions in A, we can do the same for f. Therefore we may assume that 0 < f <1onX. 
Let n > 1 be a natural number. Consider the uniform partition {0, 1/n,2/n,...,(a—1)/n, 1} 
of [0, 1] into n intervals, each of length 1/n. Fix j, 1 < j <n. Define 


Aj = {xinX| f(x) <(j—-1)/n} and Bj = {xin X| f(x) > j/n}. 
Since f is continuous, both A; and B; are closed subsets of X and, of course, they are disjoint. 
By the preceding lemma, with A = A;, B = Bj, and € = 1/n, there is a function g; in the 
algebra A for which 


8j(x) <1/nif f(x) < (j-1)/n, gj(x)>1-1/nif f(x) > j/n and0 < gj <10nX. (16) 


Define 2 
g=— dg). 
nay ] 
Then g belongs to A. We claim that 
If — gllmax < 3/n. (17) 


Once we establish this claim the proof is complete since, given € > 0, we simply select n such 
that 3/n < « and therefore || f — g||max < €. To verify (17), we first show that 


if1 <k <nand f(x) <k/n, then g(x) <k/n+1/n. (18) 


Indeed, for j =k+1,...,n, since f(x) <k/n, f(x) < (j-1)/n and therefore g(x) < 1/n. 
Thus 


Consequently, since each g;(x) < 1, for all j, 


1 1 ¥ 1 1 J 
a(x)=—Sai=—-Dait- DS gis-—Dati/nsk/n+1/n. 
a MN jxl M txk4+1 M j=] 
Thus (18) holds. A similar argument shows that 
if1<k <nand(k—1)/n < f(x), then (k—1)/n—1/n < g(x). (19) 


For x € X, choose k, 1 < k <n, such that (k —1)/n < f(x) < k/n. From (18) and (19) we 
infer that | f(x) — g(x)| <3/n. O 
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We conclude this chapter with the following elegant consequence of Urysohn’ s Lemma 
and the Stone-Weierstrass Theorem. 


Riesz’s Theorem Let X be a compact Hausforff topological space. Then C(X) is separable 
if and only if X is metrizable. 


Proof First assume X is metrizable. Let p be a metric that induces the topology on X. Then 
X, being a compact metric space, is separable. Choose a countable dense subset {x,} of X. For 
each natural number n, define f,(x) = p(x, x,) for all x € X. Since p induces the topology, 
fn 1s continuous. We infer from the denseness of {x,,} that { f,} separates points in X. Define | 
fo=1 on X. Now let A to be the collection of polynomials, with real coefficients, in a finite 
number of the f;,0 < k < oo. Then A is an algebra that contains the constant functions, 
and it separates points in X since it contains each f,. According to the Stone-Weierstrass 
Theorem, A is dense in C(X). But the collection of functions f in A that are polynomials 
with rational coefficients is a countable set that is dense in A. Therefore C(X) is separable. 


Conversely, suppose C(X) is separable. Let {g,} be a countable dense subset of 
C(X). For each natural number n, define O, = {x € X | gn(x) > 1/2}. Then (On}1 <n<oo iS a 
countable collection of open sets. We claim that every open set is the union of a subcollection 
of {On}i<n<oo, and therefore X is second countable. But X is normal, since it is compact and 
Hausdorff. The Urysohn Metrization Theorem tells us that X is metrizable. To verify second 
countability, let the point x belong to the open set O. Since X is normal, there is an open set 
U for which x € U CU C O. By Urysohn’s Lemma, there is a g in C(X) such that g(x) =1 
on U and g = 0 on X~O. By the denseness of {g,} in C(X), there is a natural number n for 
which |g — g,| < 1/2 on X. Therefore x € O, C O. This completes the proof. a 


PROBLEMS 


24. Suppose that X is a topological space for which there is a collection of continuous real-valued 
functions on X that separates points in X. Show that X must be Hausdorff. 


25. Let X be a compact Hausdorff space and A C C(X) an algebra that contains the constant 
functions. Show that A is dense in C(X) if and only if A separates points in X. 


26. Let A be an algebra of continuous real-valued functions on a compact space X that contains 
the constant functions. Let f € C(X) have the property that for some constant function c and 
real number a, the function a( f + c) belongs to A. Show that f also belongs to A. 


27. For f, g€C|a, b], show that f = g if and only if J x" f(x) dx = f° x" g(x) dx for all n. 
28. For f € Cla, b] and € > 0, show that there are real numbers co, c1,..., ¢, for which 


on 
If(x) —co— S ce | < efor all x€[a, d]. 
k=1 


29. For f €C[0, aw] and € > 0, show that there are real numbers co, c1, ... , C, for which 


n . 
If(x) —co — S) ce coskx| <€ for all x €[0, a]. 
k=l 
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30. 


31. 


32. 


33. 


35. 


Let f be a continuous real-valued function on R that is periodic with period 27. For « > 0, 
show that there are real numbers, co, a}, ..., dn, b;,..., b,, such that 


n 
| f(x) — co — S [ax cos kx + by sin kx]| < € for all x eR. 
k=1 


(Hint: A periodic function may be identified with a continuous function on the unit circle in 
the plane and the unit circle is compact and Hausdorff with the topology it inherits from the 
plane.) 


Let X and Y¥ be compact Hausdorff spaces and f belong to C(X x Y). Show that for each 
¢ > 0, there are functions fi,..., fr, in C(X) and g),..., g, in C(Y) such that 


If(x,y) - ») f(x) - ax(y)| < for all (x, y) eX XY. 
k=1 


Rather than use the Weierstrass Approximation Theorem in the proof of the Stone- 
Weierstrass Theorem, show that there are natural numbers m and n for which the polynomial 
p(x) =1—(1-— x") satisfies (14). (Hint: Since p(0) = 0, p(1) =1 and p’ > 0 on (0, 1), it 
suffices to choose m and n such that p(c/2) < and p(c) >1-€.) 

Let A be a collection of continuous real-valued functions on a compact Hausdorff space X that 


separates the points of X. Show that every continuous real-valued function on X can be uni- 
formly approximated arbitrarily closely by a polynomial in a finite number of functions of A. 


. Let A be an algebra of continuous real-valued functions on a compact Hausdorff space X. 


Show that the closure of A, A, also is an algebra. 


Let A be an algebra of continuous real-valued functions on a compact Hausdorff space 
X that separates points. Show that either A = C(X) or there is a point x9 € X for which 
A={fe C(X) | f(xo) = 0}. (Hint: If 1 € A, we are done. Moreover, if for each x € X there is 
an f € A with f(x) #0, then there is a g € A that is positive on X and this implies that 1 €¢ 4.) 


. Let X be a compact Hausdorff space and A an algebra of continuous functions on X that 


separates points and contains the constant functions. 
(i) Given any two numbers a and b and points u, v € X, show that there is a function f in A 
for which f(w) =a and f(v) =b. 


(ii) Is it the case that given any two numbers a and b and disjoint closed subsets A and B of 
X, there is a function f in A for which f = aon A and f = bon B? 
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We have already examined important specific classes of normed linear spaces. The most 
prominent of these are: (i) for a natural number n, Euclidean space R”; (ii) for a Lebesgue 
measurable subset E of real numbers and 1 < p< ov, the L?(E) space of Lebesgue 
measurable functions for which the pth power is integrable over E; (ili) for X a compact . 
topological space, the linear space C( X) of continuous real-valued functions on X, normed 
by the maximum norm. In this and the following three chapters we study general normed 
linear spaces and the continuous linear operators between such spaces. The most interesting 
results are obtained for complete normed linear spaces, which we call Banach spaces. The 
results we have established in the preceding four chapters for metric and topological spaces 
are our basic tools. © 


13.1 NORMED LINEAR SPACES 


A linear space X is an abelian group with the group operation of addition denoted by +, for 
which, given a real number a andu € X, there is defined the scalar product a-u € X for which 
the following three properties hold: for real numbers a and 6 and members u and v in X, 


(a+ B)-u=a-ut+B-u, 
a-(u+tv)=a-ut+a-v, 
(aB)-u=a-(B-u)and1l-u=u. 


A linear space is also called a vector space and, paying respect to R”, members of a linear 
space are often called vectors. The quintessential example of a linear space is the collection of 
real-valued functions on an arbitrary nonempty set D where, for two functions f, g: D> R 
and real number A, addition f+g and scalar multiplication A- f are defined pointwise on D by 


(f +e)(x) = f(x) + g(x) and (A- f)(x) =Af(x) for all x € D. 
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Recall the concept of norm on a linear space X, which we first studied in Chapter 7; a 
nonnegative real-valued function || - || defined on a linear space X is called a norm provided 
for allu,v € X anda eR: 

||u|| = 0 if and only if u = 0; 


lu + ull < [lull + lvl; 
law || = |a||]u]]. 


As we observed in Chapter 9, a norm on a linear space induces a metric on the space, where 
the distance between u and v is defined to be ||u — v|]. When we refer to metric properties of 
a normed space, such as boundedness and completeness, we mean with respect to the metric 
induced by the norm. Similarly, when we refer to topological properties, such as a sequence 
converging or a set being open, closed, or compact, we are referring to the topology induced 
by the above metric.! 


Definition Two norms || - ||; and || - ||z on a linear space X are said to be equivalent provided 
there are constants c, > 0 and cz > 0 for which 


cy + xlly S [lxll2 <2 - alls forall x € X. 


We immediately see that two norms are equivalent if and only if their induced metrics are 
equivalent. Therefore, if a norm on a linear space is replaced by an equivalent norm, the 
topological and metric properties remain unchanged. 

Concepts from linear algebra in finite dimensional spaces are also important for general 


linear spaces.” Given vectors x1,...,x, in a linear space X and real numbers Aj,..., An, 
the vector 
n 
x= » AKXk 
k=1 


is called a linear combination of the x;’s. A nonempty subset Y of X is called a linear subspace, 
or simply a subspace, provided every linear combination of vectors in Y also belongs to Y. 

For anonempty subset S of X, by the span of 5 we mean the set of all linear combinations 
of vectors in S: we denote the span of S by span[S]. We leave it as an exercise to show that 
span[S] is a linear subspace of X, which is the smallest subspace of X that contains S in the 
sense that it is contained in any linear subspace that contains S. If Y = span[S] we say that 
S spans Y. It will also be useful to consider the closure of the span of S, which we denote 
by span[S]. We leave it as an exercise to show that the closure of a linear subspace of X is 
a linear subspace. Thus span[S] is a linear subspace of X which is the smallest closed linear 
subspace of X that contains S in the sense that it is contained in any closed linear subspace 
that contains S. We call span[5] the closed linear span of 5. 


1Tn the following chapters we consider topologies on a normed linear space X other than that induced by the 
norm and are explicit when we refer to topological properties with respect to these other topologies. 

*We later refer to a few results from linear algebra but require nothing more than knowing that any two bases of 
a finite dimensional linear space have the same number of vectors, so dimension is properly defined, and that any 
linearly independent set of vectors in a finite dimensional linear space is a subset of a basis: see Peter Lax’s Linear 
Algebra [Lax97]. 
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For any two nonempty subsets A and B of a linear space X, we define the sum of A 
with B, written A + B, by | 7 


A+B={x+y| xeA,ye B}. 


In the case Bis the singleton set {xo}, we denote A + {xo} by A+ xo and call this set a translate 
of A. For A € R, we define AA to be the set of all elements of the form Ax with x ¢€ A. 
Observe that if Y and Z are subspaces of X, then the sum Y + Z is also a subspace of X. In 
the case YN Z = {0} we denote Y + Z by Y © Z and call this subspace of X the direct sum of 
Y and Z. 

For a normed linear space X, the open ball of radius 1 centered at the origin, 
{x € X | ||x|| <1}, is called the open unit ball in X and {x € X | ||x|| < 1} 1s called the closed 
unit ball in X. We call a vector x € X for which ||x|| = 1 a unit vector. | 

Almost all the important theorems for metric spaces require completeness. Therefore 
it is not surprising that among normed linear spaces those that are complete with respect to 
the metric induced by the norm will be the most important. 


Definition A normed linear space is called a Banach space provided it is complete as a metric 
space with the metric induced by the norm. 


The Riesz-Fischer Theorem tells us that for E a measurable set of real numbers 
and 1 < p < oo, L?(E) is a Banach space. We also proved that for X a compact 
topological space, C(X), with the maximum norm, is a Banach space. Of course, we 
infer from the Completeness Axiom for R that each Euclidean space R” is a Banach 
space. 


PROBLEMS 


1. Show that a nonempty subset S of a linear space X is a subspace if and onlyifS+S=S and 
A-S=S foreach A € R,A#0. © 


2. If Y and Z are subspaces of the linear space X, show that Y + Z also is a subspace and 
Y + Z =spanlY U Z]. 
3. Let S be a subset of a normed linear space X. ? 
(i) Show that the intersection of a collection of linear subspaces of X also is a linear subspace 
of X. | 


(ii) Show that span[S] is the intersection of all the linear subspaces of X that contain S and 
therefore is a linear subspace of X. 


(iii) Show that span[S] is the intersection of all the closed linear subspaces of X that contain 
S and therefore is a closed linear subspace of X. 


4. For anormed linear space X, show that the function || - ||: X > R is continuous. 


5. For two normed linear spaces (X, || - ||, ) and (Y, || - ll2), define a linear structure on the 
Cartesian product X XY by A- (x, y) = (Ax, Ay) and (x1, y1)+(x2, yo) = (m1 +22, yi t+y2). 
Define the product norm || - || by ||(x, y)|| = llxll1 + llyllz, for x € X and y € Y. Show that this 
is a norm with respect to which a sequence converges if and only if each of the two component 
sequences converges. Furthermore, show that if X and Y are Banach spaces, then so is X XY. 
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6. Let X be a normed linear space. 


(i) Let {x,} and {y,} be sequences in X such that {x,} — x and {y,} > y. Show that for any 
real numbers a and B, {ax, + Byn} > ax + By. 


(ii) Use (i) to show that if Y is a subspace of X, then its closure Y also is a linear subspace of 


(iii) Use (i) to show that the vector sum is continuous from X X X to X and scalar multiplication 
is continuous from RX X to X. 


7. Show that the set P of all polynomials on [a, b] is a linear space. For P considered as a subset 
of the normed linear space C[a, b], show that P fails to be closed. For P considered as a 
subset of the normed linear space L'[a, b], show that P fails to be closed. 


8. A nonnegative real-valued function || - || defined on a vector space X is called a pseudonorm 
if ||x + yll < Ilxil + lly] and |jax|| = |a| ||x|]. Define x = y, provided ||x — y|| = 0. Show that 
this is an equivalence relation. Define X/~ to be the set of equivalence classes of X under = 
and for x € X define [x] to be the equivalence class of x. Show that X/~ is a normed vector 
space if we define a[x] + B[y] to be the equivalence class of ax + By and define |j[x]|| = ||xl. 
Illustrate this procedure with X = L?[a, b],1 < p< ov. 


13.2 LINEAR OPERATORS 


Definition Let X and Y be linear spaces. A mapping T: X — Y is said to be linear provided 
for each u, v € X, and real numbers a and B, 


T(au + Bv) = aT(u)+BT(v). 


Linear mappings are often called linear operators or linear transformations. In linear 
algebra one studies linear operators between finite dimensional linear spaces, which, with 
respect to a choice of bases for the domain and range, are all given by matrix multiplication. 
In our study of the L?(E) spaces for 1 < p < oo, we considered continuous linear operators 
from L? to R. We called these operators functionals and proved the Riesz Representation 
Theorem that characterized them. 


Definition Let X and Y be normed linear spaces. A linear operator T: X — Y is said to be 
bounded provided there is a constant M > 0 for which 


| (u)Il < Mull for all u € X. (1) 


The infimum of all such M is called the operator norm of T and denoted by ||T ||. The collection 
of bounded linear operators from X to Y is denoted by L(X, Y). 


Let X and Y be normed linear spaces and T belong to £(X, Y). It is easy to see that 
(1) holds for M = ||T||. Hence, by the linearity of T, 


IT(u) —T(v)l] < ITI - lw — vl for all w, v € X. (2) 


Section 13.2. Linear Operators 257 
From this we infer the following continuity property of a bounded linear operator T : 


if {u,} > uin X, then {T(u,)} > T(u) in Y. (3) 
Indeed, we have the following basic result for linear operators. 


Theorem 1 A linear operator between normed linear spaces is continuous if and only if it 
is bounded. | 


Proof Let X and Y be normed linear spaces and T: X > Y be linear. If T is bounded, 
(3) tells us that T is continuous. Now suppose T: X — Y is continuous. Since T is linear, 
T(0) = 0. Therefore, by the ¢ — 6 criterion for continuity at vu = 0, with e = 1, we may choose 
5 > 0 such that ||T(u) — T(0)|| <1 if lu — Ol] < 6, that is, ||7(u)|| < 1if |lul| < 6. For any 
u € X,u#0, set A = 8/||u|| and observe by the positive homogeneity of the norm, ||Au|| < 6. 
Thus |/7(Au)|| <1. Since ||T(Au)|| = Al[T(u)|], we conclude that (1) holds for M = 1/5. U 


Definition Let X and Y be linear spaces. For T: X — Y and S: X — Y linear operators and 
real numbers a, B we define aT + BS: X — Y pointwise by | 


(aT + BS)(u) = aT(u) + BS(u) for allu € X. | (4) 


Under pointwise scalar multiplication and addition the collection of linear operators between 
two linear spaces is a linear space. | 


Proposition 2 Let X and Y be normed linear space. Then the collection of bounded linear 
operators from X to Y, £L(X, Y), is anormed linear space. 


Proof Let 7 and S belong to £(X, Y). We infer from the triangle inequality for the norm 
on Y and (2) that 


(T+ S)(w) <T(w) + US(e dS TMM] + PSH Mell = CTT + USI) ilel for all u € X. 


Therefore T + S is bounded and ||T + S|] < ||T|| + ||Sl]. It is clear that for a real number a, 
aT is bounded and ||a7|| = |a|||7|| and ||7|| = 0 if and only if T(uv) = 0 for allu € X. L 


Theorem 3 Let X and Y be normed linear spaces. If Y is a Banach space, then so is £L(X, Y). 


Proof Let {T,,} be a Cauchy sequence in £(X, Y). Let u belong to X. Then, by (2), for all. 
indices n and m, 


Zn(u) — Tm(u )Il = (Tn — Tn ull < In — Tmt - [lel 


Thus {T,,(u)} is a Cauchy sequence in Y. Since, by assumption, Y is complete, the sequence 
{T,(u)} converges to a member of Y, which we denote by T(u). This defines a mapping 
T: X — Y. We must show T belongs to £(X, Y) and {T,} > T in £(X, Y). To establish 
linearity observe that for each uy, u2 in X, since each T, is linear, 


P(ur) + Tua) = lim Tr(u1) + lim Ta(u2) = lim Tawa + 2) = T(ui + v2), 
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and similarly T(Auw) = AT(u). 
We establish the boundedness of T and the convergence of {T,} to T in £(X, Y) 


simultaneously. Let € > 0. Choose an index N such that for all n > N,k > 1, ||Tn-Th+x||</2. 
Thus, by (2), for all u € X, 


Tn (u) — Tre(u )Il = (Tn — Troe ull < Tn — Tr+ell + lull < €/2Ilul. 
Fix n > N and u € X. Since limy-5.0 Tn44(u) = T(u) and the norm is continuous we 
conclude that 
II7,(u) — T(u)|| < €/2llul. 
In particular, the linear operator Ty — T is bounded and therefore, since Ty also is bounded, 
so is T. Moreover, ||7,, — T|| < € forn > N. Thus {T,} > T in £(X, Y). LJ 


For two normed linear spaces X and Y, an operator T € £(X, Y) is called an 
isomorphism provided it is one-to-one, onto, and has a continuous inverse. For T in 
£(X, Y), if it is one-to-one and onto, its inverse is linear. To be an isomorphism requires 
that the inverse be bounded, that is, the inverse belong to L(Y, X). Two normed linear 
spaces are said to be isomorphic provided there is an isomorphism between them. This is an 
equivalence relation that plays the same role for normed linear spaces that homeomorphism 
plays for topological spaces. An isomorphism that also preserves the norm is called an 
isometric isomorphism: it is an isomorphism that is also an isometry of the metric structures 
associated with the norms. 

For a linear operator T: X — Y, the subspace of X, {x € X |T(x) = 0}, is called the 
kernel of 7 and denoted by ker 7. Observe that T is one-to-one if and only if ker T = {0}. 
We denote the image of T, T(X), by Im T. 


PROBLEMS 
9. Let X and Y be normed linear spaces and T: X —> Y be linear. 
(i) Show that T is continuous if and only if it is continuous at a single point ug in X. 


(ii) Show that T is Lipschitz if and only if it is continuous. 
(iii) Show that neither (i) nor (ii) hold in the absense of the linearity assumption on T. 


10. For X and Y normed linear spaces and T € L£(X, Y), show that ||7'|| is the smallest Lipschitz 
constant for the mapping 7, that is, the smallest number c > 0 for which 


|T(u) — T(v)|| <c- lu — vl for all u, v € X. 


11. For X and Y normed linear spaces and T € £(X, Y), show that 
Tl] =sup {IT(u) | «eX, ull <1}. 
12. For X and Y normed linear spaces, let {T,} > T in £(X, Y) and {u,} — u in X. Show that 
{T,(un)} > T(u) in Y. 
13. Let X be a Banach space and T € £(X, X) have ||T|| < 1. 


(i) Use the Contraction Mapping Principle to show that J— T € £(X, X) is one-to-one and 
onto. 


(ii) Show that J — T is an isomorphism. 


14. 


15. 


16. 


17. 


18. 
19. 


20. 


2. 


22. 
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(Neumann Series) Let X be a Banach space and T € £(X, X) have ||T|| < 1. Define T =Id. 
(i) Use the completeness of £(X, X) to show that S T”" converges in £(X, X). 
n=0 


10,@) 
(ii) Show that (7—-T)-! = 5 7". 
n=0 
For X and Y normed linear spaces and T € £(X, Y), show that T is an isomporphism if and 
only if there is an operator S € £(Y, X) such that for each u € X andve ¥, | 


S(T(u)) =uand T(S(v)) =v. 


For X and Y normed linear spaces and T € £(X, Y), show that ker T is a closed subspace of 
X and that T is one-to-one if and only if ker T = {0}. 


Let (X, p) be a metric space containing the point xo. Define Lipp(X) to be the set of 
real-valued Lipschitz functions f on X that vanish at xo. Show that Lipo(X) is a linear space 
that is normed by defining, for f € Lip,(X), 


If(x) -— FO) 
; . 


ll fl] = su 
| ! * x,y) 


Show that Lip)(X) is a Banach space. For each x € X, define the linear functional F, 
on Lipg(X) by setting F,(f) = f(x). Show that F, belongs to L(Lipy(X), R) and that 
for x, y € X, ||Fx — Fyll = p(x, y). Thus X is isometric to a subset of the Banach space 
£(Lipp(X), R). Since any closed subset of a complete metric space is complete, this provides 
another proof of the existence of a completion for any metric space X. It also shows that any 
metric space is isometric to a subset of a normed linear space. 


Use the preceding problem to show that every normed linear T space is a dense subspace of a 


Banach space. 


For X a normed linear space and T,S € L(X, X), show that the composition So T also 
belongs to £(X, X) and |S oT] < ISI] - ITI. 


Let X be a normed linear space and Y a closed linear subspace of X. Show that 
lIxll1 = inf yey |x — yl] defines a pseudonorm on X. The normed linear space induced by 
the pseudonorm || - ||; (see Problem 8) is denoted by X/Y and called the quotient space of X 
modulo Y. Show that the natural map ¢ of X onto X/Y takes open sets into open sets. 


Show that if X is a Banach space and Y a closed linear subspace of X, then the quotient X/Y 
also is a Banach space and the natural map g: X > X/Y has norm 1. 


Let X and Y be normed linear spaces, T € £(X, Y) and ker T = Z. Show that there i is a 
unique bounded linear operator S from X/Z into Y such that T = So g where g: X > X/Z 
is the natural map. Moreover, show that ||T |} = || 5]. 


13.3 COMPACTNESS LOST: INFINITE DIMENSIONAL NORMED LINEAR SPACES 


A linear space X is said to be finite dimesisional provided there is a subset {e1, ..., én} of X 
that spans X. If no proper subset also spans, X ; we call the set {e1, ..., én} a basis for X and call 
n the dimension of X. If X is not spanned by a finite collection of vectors it is said to be infinite 
dimensional. Observe that a basis {e;>.{2., e,} for X is linearly independent in the sense that 


if S xje; #0, then x; = 0 for all 1 <i <n, 
i=. 
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for otherwise a proper subset of {e),..., e,} would span X. 
Theorem 4 Any two norms on a finite dimensional linear space are equivalent. 


Proof Since equivalence of norms is an equivalence relation on the set of norms on X, it 
suffices to select a particular norm || - ||, on X and show that any norm on X is equivalent to 
| - |,. Let dim X = n and {e,,...,e,} be a basis for X. For any x = xje@y +... + Xe, € X, 


define 
xl, = x2 +... 422. 
Since the Euclidean norm is a norm on R", || - ||, is anorm on X. 
Let || - || be any norm on X. We claim it is equivalent to || - ||,. First we find ac; > 0 
for which 


llxl] < cy - |x|], for all x € X. (5) 
Indeed, for x = xye; +...+Xne, € X, by the subadditivity and positive homogeneity of the 


norm || - ||, together with the Cauchy-Schwarz inequality on R”, 


n 
Ix < > ville <u> 


i=] i=] 


n 


|x;| = M./n|x||,, where M = max |le;|. 
1<i<n 


Therefore (5) holds for c, = M./n. We now find a c) > 0 for which 


|xllx < C2 - ||x|| for all x € X. (6) 


Define the real-valued function f: R" > R by 


f(xq,..., Xn) = 


n 
S Xjej|| . 
i=] 


This function is continuous since it is Lipschitz with Lipschitz constant c, if R" is considered 
as a metric space with the Euclidean metric. Since {e;, ..., en} is linearly independent, f 


takes positive values on the boundary of the unit ball, § = {x € R”| > x? = 1}, which is 


1=1 
compact since it is both closed and bounded. A continuous real-valued function on a compact 
topological space takes a minimum value. Let m > 0 be the minimum value of f on S. By 
the homogeneity of the norm || - ||, we conclude that 


|x|] => m - ||x|], for all x € X. 
Thus (6) holds for cy = 1/m. O 
Corollary 5 Any two normed linear spaces of the same finite dimension are isomorphic. 


Proof Since being isomorphic is an equivalence relation among normed linear spaces, it 
suffices to show that if X is a normed linear space of dimension n, then it is isomorphic to the 
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Euclidean space R". Let {e1,..., @n} be a basis for X. Define the linear mapping T: R" > X 
by setting, for x = (x1,...,%,) ER", 


n 
T (x) = S Xj ej. 
i=1 
Since {e;,..., €,} is a basis, T is one-to-one and onto. Clearly T is linear. It remains to show 


that T and its inverse are continuous. Since a linear operator is continuous if and only if it 
is bounded, this amounts to showing that there are constants c; and cz such that for each 
xER’, 

IT(x)I] <1: Ulalle and |I7(x)I| = co [lalls, 


where || - ||, denotes the Euclidean norm on R". The existence of these two constants follows 
from the observation that x + ||T(x)|| defines a norm on R", which, since all norms on R” 
are equivalent, is equivalent to the Euclidean norm. L 


Corollary 6 Any finite dimensional normed linear space is complete and therefore any finite 
dimensional subspace of a normed linear space is closed. 


Proof A finite dimensional space of dimension n is isomorphic to the Euclidean space R", 
which is complete since R is complete. Since completeness is preserved under isomorphisms, 
every finite dimensional normed linear space is complete. For a finite dimensional subspace 
Y of a normed linear space X, since Y, with the metric induced by the inherited norm, is 
complete, Y is a closed subset of the metric space X, where X is considered as a metric space 
with the metric induced by the norm. L 


Corollary 7 The closed unit ball in a finite dimensional normed linear space is compact. 


Proof Let X be a normed linear space of dimension n and B be its closed unit ball. Let 
T: X — R" be an isomorphism. Then the set T(B) is bounded since the operator T is 
bounded and 7(B) is closed since T~! is continuous. Therefore, T(B), being a closed 
bounded subset of R", is compact. Since compactness is preserved by continuous mappings 
and T~! is continuous, B is compact. 


Riesz’s Theorem The closed unit ball of a normed linear space X is compact if and only if X 
is finite dimensional. 


The heart of the proof of this theorem lies in the following lemma. 


Riesz’s Lemma Let Y be a closed proper linear subspace of a normed linear space X. Then 
for each € > 0, there is a unit vector xy) € X for which 


xo — yll > 1—€ forallyeY. 


Proof We consider the case « = 1/2 and leave the general case as an exercise. Since Y is a 
proper subset of X, we may choose x € X~Y. Since Y is a closed subset of X, its complement 
in X is open and therefore there is a ball centered at x that is disjoint from Y, that is, 


inf {Jx-—y] | ye Y}=d>0. (7) 
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Choose a vector y,; € Y for which 


|x — yy || < 2d. (8) 
Define 
x — Yi 
xo = ————_.. 
lx — y4]| 


Then xp is a unit vector. Moreover, observe that for any ye XY, 


x- yj 1 1 
0 - YS xe vi — I = yall y} = ———[x- y}, 
lx — yall lx — yall lx — yil| 


where y’ = y; + ||x — yi|| y belongs to Y. Therefore, by (7) and (8), 


| [| > aT n> 1 to allye Y 
— 7 _ _— T . 
*0~ Yi salto — VI = 5 y 0 


Proof of Riesz’s Theorem We have already shown that the closed unit ball B in a finite 
dimensional normed linear space is compact. It remains to show that B fails to be compact 
if X is infinite dimensional. Assume dim X = oo. We will inductively choose a sequence {x,} 
in B such that ||x, — xm|| > 1/2 for n#m. This sequence has no Cauchy subsequence and 
therefore no convergent subsequence. Thus B is not sequentially compact, and therefore, 
since B is a metric space, not compact. 

It remains to choose this sequence. Choose any vector x, € B. For a natural number 
n, Suppose we have chosen n vectors in B, {x;,...,x,}, each pair of which are more than a 
distance 1/2 apart. Let X,, be the linear space spanned by these n vectors. Then X,, is a finite 
dimensional subspace of X and so it is closed. Moreover, X, is a proper subspace of X since 
dim X = oo. By the preceding lemma we may choose x,,; in B such that |x; — x,41|| > 1/2 
for 1 <i <n. Thus we have inductively chosen a sequence in B any two terms of which are 
more than a distance 1/2 apart. LO 


PROBLEMS 


23. Show that a subset of a finite dimensional normed linear space X is compact if and only if it 
is Closed and bounded. 


24. Complete the proof of Riesz’s Lemma for € # 1/2. 


25. Exhibit an open cover of the closed unit ball of X = ¢? that has no finite subcover. Then do 
the same for X = C[0, 1] and X = L(0, 1]. 


26. For normed linear spaces X and Y, let T: X — Y be linear. If X is finite dimensional, show 
that T is continuous. If Y is finite dimensional, show that T is continuous if and only if ker T 
is closed. 


27. (Another proof of Riesz’s Theorem) Let X be an infinite dimensional normed linear space, B 
the closed unit ball in X, and Bo the unit open ball in X. Suppose B is compact. Then the open 
cover {x + (1/3) Bo}xeg of B has a finite subcover {x; + (1/3) Bo}i<j<n. Use Riesz’s Lemma 
with Y = span[{x1, ..., x,}] to derive a contradiction. 


28. Let X be a normed linear space. Show that X is separable if and only if there is a compact 
subset K of X for which span[K] = X. 
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13.4 THE OPEN MAPPING AND CLOSED GRAPH THEOREMS 


In this section, we use the Baire Category Theorem to establish two essential tools, the Open 
Mapping Theorem and the Closed Graph Theorem, for the analysis of linear operators 
between infinite dimensional Banach spaces. The Baire Category Theorem is used to prove 
the following theorem. 


Theorem 8 Let X and Y be Banach spaces and the linear operator T: X — Y be continuous. 
Then T(X) is a closed subspace of Y if and only if there is a constant M > 0 for which given 
y €T(X), there is an x € X such that 


T(x) =y, xl < Mllyll. (9) 


Proof First suppose there is a constant M > 0 for which (9) holds. Let {y,} be a sequence 
in T(X) that converges to y, € Y. We must show y, belongs to T(X). By selecting a 
subsequence if necessary, we may assume 


lyn — Yn—11| < 1/2” for all n > 2. 
By the choice of M, for each natural number n > 2, there is a vector u, € X for which | 
T (un) = Yn — Yn and |lup|| < M/2". 


Therefore, for n > 2, if we define x, = Dy i=2 u j, then 


T(%n)=yn—- MN (10) 
and © | 
00 . 
IXn+k — Xnl| < M- > 1/2/ for all k > 1. (11) 
jan 


But X is a Banach space and therefore the Cauchy sequence {x,} converges to a vector x, € X. 
Take the limit as n — oo in (10) and use the continuity of T to infer that y, = T(x.) — yt. 
Since y; belongs to T(X) so does y,. Thus T(X) is closed. 

To prove the converse, assume 7(X) is a closed subspace of Y. For notational 


convenience, assume Y = T(X). Let By and By denote the open unit balls in X and Y, 
respectively. Since T(X) = Y, 


oO oO 
Y=(Jn-T(Bx) =(Jn-T(Bx). 
| n=1 n=1 . 

The Banach space Y has nonempty interior and therefore we infer from the Baire Category 
Theorem that there is a natural number n such that the closed set n -T( By ) contains an open 
ball, which we write as yo + [r; - By]. Thus 


r, By CnT( By) — yo CG 2nT( By). 


Hence, if we set r = 2n/r), since T( Bx) is closed, we obtain By Cr-T(Bx). Therefore, 
since By is the closed unit ball in Y, for each y € Y and € > 0, there is an x € X for which | 


lly — T(x)|| < and |[x|| <r- |lyl. (12) 
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We claim that (9) holds for M = 2r. Indeed, let y, belong to Y, y, #0. According to 
(12) with « = 1/2 - || y,|| and y = y,, there is a vector u, € X for which 


lly — T(ur) || < 1/2 - [Lyi] and |luq || <r: [Lyell- 


Now use (12) again, this time with « = 1/ 2? . lly,l| and y= y, —T (uw, ). There is a vector u2 
in X for which 


lye — T(u1) — T(uz)I]| < 1/2? - [Lyell and luz] < 7/2 > ILyell. 


We continue this selection process and inductively choose a sequence {u;,} in X such that for 
each k, 


lly — T(u1) — T(u2) —...— T (ux) <1/2* + Wye] and flugll < 7/24 - Uy. 
For each natural number n, define x, = Y7_; ux. Then, by the linearity of T, for each n, 


lly — T(xn)|| < 1/2” - Ily«ll, 


oO 
Xn+k — Xnll <r: ysl - S 1/2/ and ||xn\| < 2-r- llysll- 
jan 
By assumption, X is complete. Therefore the Cauchy sequence {x,} converges to a vector x, 
in X. Since T is continuous and the norm is continuous, 


T(x) = yx and |[x4l] < 2-7 Ilys. 
Thus (9) holds for M = 2 -r. The proof is complete. L 


A mapping f: X — Y from the topological space X to the topological space Y is said 
to be open provided the image of each open set in X is open in the topological space f(X), 
where f(X) has the subspace topology inherited from Y. Therefore a continuous one-to-one 
mapping f of X into Y is open if and only if f is a topological homeomorphism between X 
and f(X). 


The Open Mapping Theorem Let X and Y be Banach spaces and the linear operator 
T: X — Y be continuous. Then its image T(X) is a closed subspace of Y if and only if the 
operator T is open. 


Proof The preceding theorem tells us that it suffices to show that T is open if and only if 
there is a constant M > 0 for which (9) holds. Let By and By denote the open unit balls 
in X and Y, respectively. We infer from the homogeneity of T and of the norms that (9) is 
equivalent to the inclusion 

By NT(X) CM -T(By). 


By homogeneity, this inclusion is equivalent to the existence of a constant M’ for which 
By NT(X) CM’ - T( Bx). Therefore, we must show that T is open if and only if there is an 
r > 0 for which 


[r- By] NT(X) CT( By). (13) 
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First assume the operator T is open. Then T( By) T(X) is an open subset of T(X) which 
contains 0. Thus there is an r > 0 for which r - By N T(X) C T(By) NT(X) C T( Bx). 
Therefore, (13) holds for this choice of r. To prove the converse, assume (13) holds. Let O 
be an open subset of X and x belong to O. We must show that 7(xp) is an interior point of 
T(O). Since xo is an interior point of O, there is an R > 0 for which x) + R- By CO. We infer 
from (13) that the the open ball of radius r - R about T(x9) in T(X) is contained in T(O). 
Thus T(x) is an interior point of T(). LI 


Corollary 9 Let X and Y be Banach spaces and T € £L(X, Y) be one-to-one and onto. Then 
T~' is continuous. 


Proof The operator T~! is continuous if and only if the operator T is open. L 
Corollary 10 Let || - || and || - ||2 be norms ona linear space X for which both (X, || - ||, ) and 
(X, || - ll2) are Banach spaces. Suppose there is ac > 0 for which 

Il-ll2<c-l]- lon xX. 


Then these two norms are equivalent. 


Proof Define the identity map Id: X > X by Id(x) = x for all x € X. By assumption, 
Id: (X, Il - ll1) > (X, Il Iz) 


is a bounded, and therefore continuous, operator between Banach spaces and, of course, it 
is both one-to-one and onto. By the Open Mapping Theorem, the inverse of the identity, 
Id: (X, || - lz) ~ (X, Il - ll1) also is continuous, that is, it is bounded: there is an M > 0 
for which 

l-lh <M-||-ll2 on x. 


Therefore the two norms are equivalent. LJ 


Definition A linear operator T: X — Y between normed linear spaces X and Y is said to be 
closed provided whenever {x,} is a sequence in X 


if {Xn} —> x9 and {T(x )} > yo, then T(x) = yo. 
The graph of a mapping of T: X — Y¥ is the set {(x, T(x)) € XX Y|x € X}. Therefore 
an operator is closed if and only if its graph is a closed subspace of the product space X x Y. 


The Closed Graph Theorem Let T: X — Y bea linear operator between the Banach spaces 
X and Y. Then T is continuous if and only if it is closed. 


Proof It is clear that T is closed if it is continuous. To prove the converse, assume T is closed. 
Introduce a new norm || - ||, on X by 


lle = [xl] + |7(x)]| for all x € X. 


The closedness of the operator T is equivalent to the completeness of the normed linear 
Space (X, || - ||«). On the other hand, we clearly have 


I <il- ll. on X. 
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Since both (X, ||- |.) and (X, ||-|| ) are Banach spaces it follows from the preceding corollary 
that there is ac > 0 for which 
Il - lle <c-||- on X. 


Thus for all x eX, 
T(x) S le + IT (4) < ella. 


Therefore T is bounded and hence is continuous. L] 


Remark Let X and Y be Banach spaces and the operator T: X — Y be linear. To establish 
continuity of T it is necessary to show that {T(xn)} — T(xo) in Y if {x} ~ xo in X. The 
Closed Mapping Theorem provides a drastic simplification in this criterion. It tells us that to 
establish continuity of T it suffices to check that {T (xn )} — T(x0) in ¥ for sequences {x,} such 
that {x,} —> x in X and {T(x, )} is Cauchy in Y. The usefulness of this simplification will be 
evident in the proof the forthcoming Theorem 11. 


Let V be a linear subspace of a linear space X. An argument using Zorn’s Lemma 

(see Problem 35) shows that there is a subspace W of X for which there is the direct sum 
decomposition 

X=VOW. (14) 


We call W a linear complement of V in X. If a subspace of X has a finite dimensional 
linear complement in X, then it is said to have finite codimension in X. For x € X and the 
decomposition (14), let x = u + v, for v € V and w € W. Define P(x) = v. We leave it as an 
algebraic exercise to show that P: X — X is linear, 


P* = Pon X, P(X) = Vand (Id—P)(X) = W. (15) 


We call P the projection of X onto V along W. We leave it as a second algebraic exercise to 
show that if P: X — X is any linear operator for which P? = P, then 


X = P(X) ® (Id—P)(X). (16) 


We therefore call a linear operator P: X — X for which P* = P a projection. If P is a 
projection, then (Id—P)* = Id —P and therefore Id — P also is a projection. 

Now assume the linear space X is normed. A closed subspace W of X for which 
(14) holds is called a closed linear complement of V in X. In general, it is very difficult to 
determine if a subspace has a closed linear complement. Corollary 8 of the next chapter 
tells us that every finite dimensional subspace of a normed linear space has a closed linear 
complement. Theorem 3 of Chapter 16 tells us that every closed subspace of a Hilbert space 
has a closed linear complement. For now we have the following criterion, in terms of the 
continuity of projections, for the existence of closed linear complements. 


Theorem 11 Let V be a closed subspace of a Banach space X. Then V has a closed linear 
complement in X if and only if there is a continuous projection of X onto V. 


Proof First assume there is a continuous projection P of X onto V. There is the direct 
sum decomposition X = V ® (Id—P)(X). We claim that (Id—P)(X) is closed. This is a 
consequence of the continuity of the projection Id — P. To prove the converse, assume there 
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is a closed subspace W of X for which there is the direct sum decomposition (14). Define 
P to be the projection of X onto V along W. We claim that P is continuous. The Closed 
Graph Theorem tells us that to verify this claim it is sufficient to show that the operator P 
is Closed. Let {x,} be a sequence in X for which {x,} — x9 and {P(x )} > yo. Since {P(xp )} 
is a Sequence in the closed set V that converges to yo, the vector yo belongs to V. Since 
(Id ~P)(x, )} is a sequence in the closed set W that converges to xp — yo, the vector x9 — yo 
belongs to W. Therefore P( yo) = yo and P(xo — yo) = 0. Hence yo = P(xq). Thus the 
operator P 1s closed. L 


In view of Theorem 8 and its corollary, the Open Mapping Theorem, it is interesting 
to provide criteria to determine when the image of a continuous linear operator is closed. 
The following theorem provides one such criterion. 


Theorem 12 Let X and Y be Banach spaces and the linear operator T: X — Y be continuous. 
If T(X) has a closed linear complement in Y, then T( X) is closed in Y. In particular, if T(X) 
has finite codimension in Y, then T( X ) is closed in Y. 
Proof Let Yo be a closed subspace of Y for which 
T(X)®Y=Y. (17) 

Since Y is a Banach space, so is Yo. Consider the Banach space X X Yo, where the linear 
structure on the Cartesian product is defined componentwise and the norm is defined by 

I(x, yl = lal + yl] for all (x, y) e XX Yo. 
Then X X Yo is a Banach space. Define the linear operator S: X X Yo > Y by 

S(x, y) =T(x)+4+ y for all (x, y)e XXYV. 


Then S is a continuous linear mapping of the Banach space X X Yo onto the Banach space 
Y. It follows from Theorem 8 that there is an M > 0 such that for each y € Y there is an 
(x’, y') € X X Yo for which 


T(x’) + y' = y and |[x'|| + lly’ll < M- [lyll. 
Thus, since T(X) M Yo = {0}, for each y € T(X), there is an x € X for which 
T(x) = yand ||x|| < M- [ly]. 


Once more we use Theorem 8 to conclude that T(X) is a closed subspace of Y. Finally, 
since every finite dimensional subspace of a normed linear space is closed, if T(X ) has finite 
codimension, it is closed. CL] 


Remark All linear operators on a finite dimensional normed linear space are continuous, 
open, and have closed images. The results in the section are only significant for linear operators 
defined on infinite dimensional Banach spaces, in which case continuity of the operator does 
not imply that the image is closed. We leave it as an exercise to verify that the operator 
T: £2 — &? defined by 

T({%n}) = (2n/n} for {xn} € 


is continuous but does not have closed image and is not open. LJ 
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29. 


30. 
31. 


32. 


33. 


35. 


36. 


37. 
38. 


PROBLEMS 


Let X be a finite dimensional normed linear space and Y a normed linear space. Show that 
every linear operator T: X — Y is continuous and open. 


Let X be a Banach space and P € £(X, X) be a projection. Show that P is open. 


Let T: X — Y be a continuous linear operator between the Banach spaces X and Y. Show 
that T is open if and only if the image under T of the open unit ball in X is dense in a 
neighborhood of the origin in Y. 


Let {u,} be a sequence in a Banach space X. Suppose that 7° , ||ux|| < oo. Show that there is 
an x € X for which 


n 
lim S Up =X. 
NO V1 


Let X be a linear subspace of C[0, 1] that is closed as a subset of L?[0, 1]. Verify the following 
assertions to show that X has finite dimension. The sequence { f,,} belongs to X. 


(i) Show that X is a closed subspace of C[0, 1]. 


(ii) Show that there is a constant M > 0 such that for all f € X we have || fl2 < || fll. and 
Il flloo <M - Il fille. 


(iii) Show that for each y € [0, 1], there is a function ky in L* such that for each f € X we 
1 
have f(y) = fp ky(x) f(x) dx. 
(iv) Show that if {f,} —> f weakly in L’, then { f,} — f pointwise on [0, 1]. 


(v) Show {f,} — f weakly in L’, then { f,} is bounded (in what sense?), and hence { f,} — f 
strongly in L? by the Lebesgue Dominated Convergence Theorem. 


(vi) Conclude that X, when normed by || - ||2, has a compact closed unit ball and therefore, 
by Riesz’s Theorem, is finite dimensional. 


. Let T be a linear operator from a normed linear space X to a finite-dimensional normed 


linear space Y. Show that T is continuous if and only if ker T is a closed subspace of X. 


Suppose X be a Banach space, the operator T € £L(X, X) be open and Xp be a closed 
subspace of X. The restriction 7) of T to Xo is continuous. Is Ty necessarily open? 


Let V be a linear subspace of a linear space X. Argue as follows to show that V has a linear 
complement in X. 


(i) Ifdim X < oo, let {e;}?_, be a basis for V. Extend this basis for V to a basis fein for X. 
Then define W = span|{en41,..., en+x}]. 


(1) If dim X = ov, apply Zorn’s Lemma to the collection F of all subspaces Z of X for which 
VM Z = {0}, ordered by set inclusion. 


Verify (15) and (16). 


Let Y be a normed linear space. Show that Y is a Banach space if and only if there is a Banach 
space X and a continuous, linear, open mapping of X onto Y. 


13.5 THE UNIFORM BOUNDEDNESS PRINCIPLE 


As a consequence of the Baire Category Theorem we proved that if a family of continuous 
functions on a complete metric space is pointwise bounded, then there is an open set on 
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which the family is uniformly bounded. This has the following fruitful consequences for 
families of linear operators. 


The Uniform Boundedness Principle For X a Banach space and Y a normed linear space, 
consider a family F € £L(X, Y). Suppose the family F is pointwise bounded in the sense that 
for each x in X there is a constant M, > 0 for which 


T(x) || < M, for all T € F. 


Then the family F is uniformly bounded in the sense that there is a constant M > 0 for which 
||| < M for all T in F. 


Proof For each T é€ Ff, the real-valued function f7: X — R defined by f(x) = ||Tx|| is a 
real-valued continuous function on X. Since this family of continuous functions is pointwise 
bounded on X and the metric space X is complete, by Theorem 6 of Chapter 10, there is an 
open ball B( xo, r) in X and a constant C > 0 for which 


| 7(x)|] < C for all x € B(xo, r) and T € F. 
Thus, for each T € F, 
IT (x) = T({x + x0] — x0) I] < T(x + x0 )I + IT (20)|| < C + May for all x € B(O, r). 


Therefore, setting M = (1/r)(C + M,, ), we have ||T|| < M for all T in F. L 


The Banach-Saks—Steinhaus Theorem Let X be a Banach space, Y a normed linear space, 
and {T,,: X — Y} a sequence of continuous linear operators. Suppose that for each x € X, 


lim T,(x) exists in Y. (18) 


Then the sequence of operators {T,,: X — Y} is uniformly bounded. Furthermore, the operator 
T: X — Y defined by 
T(x) = lim T,,(x) for all for all x € X 
n-> OO 


is linear, continuous, and 
||| < lim inf ||T,, |. 


Proof The pointwise limit of a sequence of linear operators is linear. Thus T is linear. 
We infer from the Uniform Boundedness Principle that the sequence {T,,} is uniformly 
bounded. Therefore lim inf ||7,,|| is finite. Let x belong to X. By the continuity of the norm 
on Y, limy+oo ||Tn(x) || > |]T (x). Since, for all n, ||7,(x)l] < [Tull - |lxl], we also have 
|7(x)|| < lim inf ||7, || - ||x|]. Therefore T is bounded and ||T|| < liminf ||T,||. A bounded 
linear operator is continuous. LJ 


In the case that Y is a Banach space, (18) is equivalent to the assertion that for each 
x € X, {T,(x)} is a Cauchy sequence in Y. 
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39. 


41. 


42. 


PROBLEMS 


As a consequence of the Baire Category Theorem we showed that a mapping that is the 
pointwise limit of a sequence of continuous mappings on a complete metric space must be 
continuous at some point. Use this to prove that the pointwise limit of a sequence of linear 
operators on a Banach space has a limit that is continuous at some point and hence, by 
linearity, is continuous. 


. Let {f,} be a sequence in L}{a, b]. Suppose that for each g € La, 5], littn+soo f 8 dn 


exists. Show that there is a function in f € L'[a, b] such that limy_,0. f 78 in =JSe8- f for 
all ge L™[a, bj. 


Let X be the linear space of all polynomials defined on R. For p € X, define || p|| to be the 
sum of the absolute values of the coefficients of p. Show that this is a norm on X. For each n, 
define ,: X > R by y,(p) = p\")(0). Use the properties of the sequence {i),} in L(X, R) 
to show that X is not a Banach space. 


(i) Use Zorn’s Lemma to show that every linear space has a Hamel basis. 
(ii) Show that any Hamel basis for an infinite dimensional Banach space must be uncountable. 


(iii) Let X be the linear space of all polynomials defined on R. Show that there is not a norm 
on X with respect to which X is a Banach space. 
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For a normed linear space X, we denoted the normed linear space of continuous linear 
real-valued functions of X by X* and called it the dual space of X. In this and the following 
chapter, we explore properties of the mapping from X x X* to R defined by 


(x, w)+> (x) for all x € X, p € X* 


to uncover analytic, geometric, and topological properties of Banach spaces. The departure 
point for this exploration is the Hahn-Banach Theorem. This is a theorem regarding the 
extension of certain linear functionals on subspaces of an unnormed linear space to linear 
functionals on the whole space. The elementary nature of this theorem provides it with such 
flexibility that in this chapter we deduce from it the following three properties of linear 
functionals: (i) for a normed linear space X, any bounded linear functional on a subspace of 
X may be extended to a bounded linear functional on all of X, without increasing its norm, 
(ii) for a locally convex topological vector space X, any two disjoint closed convex sets of 
X may be separated by a closed hyperplane; and (iii) for a reflexive Banach space X, any 
bounded sequence in X has a weakly convergent subsequence. 


14.1 LINEAR FUNCTIONALS, BOUNDED LINEAR FUNCTIONALS, 
AND WEAK TOPOLOGIES 


Let X be a linear space. We denote by X® the linear space of linear real-valued functions on 
X. For # € X", # #0, and xp € X for which (xp) #0, we claim that X may be expressed as 
the direct sum 

X = [ker W] ® span [xo], (1) 
where the kernel of w, kerw, is the subspace {x € X|w(x) = 0}. Indeed, clearly 
[ker #] MN span [xo] = {0}. On the other hand, we may write each x € X as 


-|,- YO) [4 Ho ., ,- YO), = 
‘=| Wh(x0) +2 sandv( (x0) ) " 


272 Chapter 14 Duality for Normed Linear Spaces 


Observe that for a real number c, if xo belongs to X and (x9) = c, then 


Wl(c)={xeX| (x) =c} =kerb + x9. 


Therefore, by (1), if X is finite dimensional of dimension n and yf is nonzero, then for each 
c ER, the level set y~!(c) is the translate of an (n — 1) dimensional subspace of X. 

If a linear subspace Xo of X has the property that there is some xp € X, x9 #0 for which 
X = Xo ® span [xo], then Xo is said to be of codimension 1 in X. A translate of a subspace 
of codimension 1 is called a hyperplane. 


Proposition 1 A linear subspace Xo of a linear space X is of codimension 1 if and only if 
Xo = ker, for some nonzero  € X*. 


Proof We already observed that the kernel of a nonzero linear functional is of codimension 1. 
Conversely, suppose Xo is a subspace of codimension 1. Then there is a vector x9 #0 for 
which X = Xo ® span[xo]. For A € R and x € Xo, define W(x + Axo) = A. Then # #0, w is 
linear and ker = Xo. LJ 


The following proposition tells us that the linear functionals on a linear space are 
plentiful. 


Proposition 2 Let Y be a linear subspace of a linear space X. Then each linear functional on 
Y has an extension to a linear functional on all of X. In particular, for each x € X, there is a 
we X® for which w(x) 40. 


Proof As we observed in the preceding chapter (see Problem 36 of that chapter), Y has a 
linear complement in X, that is, there is a linear subspace Xo of X for which there is the 
direct sum decomposition 

X=YO@Xpo. 


Let 7 belong to Y#. For x € X, we have x = y +x, where y € Y and xo € Xo. Define 
n(x) = n(y). This defines an extension of 7 to a linear functional on all of X. 


Now let x belong to X. Define n: span [x] — R by n(Ax) = A- ||x||. By the first part of 
the proof, the linear functional 7 has an extension to a linear functional on all of X. L 


We are particularly interested in linear spaces X that are normed and subspaces of X! 
that are contained in the dual space of X, X*, that is, linear spaces of linear functionals that 
are continuous with respect to the topology induced by the norm. If X is a finite dimensional 
normed linear space, then every linear functional on X belongs to X* (see Problem 3). This 
property characterizes finite dimensional normed linear spaces. 

A subset B of a linear space X is called a Hamel basis for X provided each vector in X is 
expressible as a unique finite linear combination of vectors in B, We leave it as an exercise to 
infer from Zorn’s Lemma that every linear space possesses a Hamel basis (see Problem 16). 


Proposition 3. Let X be a normed linear space. Then X is finite dimensional if and only if 
x! = x. 


Section 14.1 Linear Functionals, Bounded Linear Functionals, and Weak Topologies 273 


Proof We leave it as an exercise to show that since all norms on a finite dimensional linear 
space are equivalent, all linear functionals on such spaces are bounded. Assume X is infinite 
dimensional. Let B be a Hamel basis for X. Without loss of generality we assume the vectors 
in B are unit vectors. Since X is infinite dimensional, we may choose a countably infinite 
subset of B, which we enumerate as {x;,}7° ,. For each natural number k and vector x € X, 
define 1; (x) to be the coefficient of x; with respect to the expansion of x in the Hamel basis 
B. Then each yy, belongs to X# and therefore the functional ys: X > R defined by 


w(x) = S k «Wy (x) for all x € X 
k=1 


also belongs to X#. This linear functional is not bounded since each x; is a unit vector for 
which (x) =k. L 


The following algebraic property of linear functionals is useful in establishing properties 
of weak topologies. 


Proposition 4 Let X be a linear space, the functional belong to X*, and {yp;}"_, be contained 
in X*. Then is a linear combination of {yp;}"_, if and only if 


ker #; C kerw. (2) 
i=1 


I 


Proof It is clear that if is a linear combination of {w;}?_,, then the inclusion (2) holds. 
We argue inductively to prove the converse. For n = 1, suppose (2) holds. We assume 
ww #0, for otherwise there is nothing to prove. Choose x9 #0 for which (x9) = 1. Then 
W1(xo) #0 also since kery, C kerw. However, X = ker, ® span[xo]. Therefore, if we 
define A; = 1/i;(x0) we see, by direct substitution, that y = A ,y,. Now assume that for 
linear functionals on any linear space, if (2) holds forn = k—1, then wis a linear combination 
of Wy, ..., #1. Suppose (2) holds for n = k. If #;, = 0, there is nothing to prove. So choose 
xo € X with (x0) = 1. Then X = Y ® span [xo], where Y = ker yy, and therefore 


k-1 

(\[kerp; NY] Ckerwny. 

i=l 
By the induction assumption, there are real numbers Aj, ..., Ay_; for which 
k-1 
y= S) Ai yy; on Y. 
i=1 
A direct substitution shows that if we define A, = (x9) - a dA; - Wi(xo), then 


k 
= A; - jon X. 
yee ; 
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Recall that for two topologies T; and T2 on a set X, we say that TJ; is weaker 
than T>, or T> is stronger than 71, provided T; C T>. Observe that a function on X 
that is continuous with respect to a topology on X, then it is also continuous with respect 
to any stronger topology on X but may not be continuous with respect to a weaker 
topology. If F is any collection of real-valued functions on a set X, the weak topology 
on X induced by F, or the F-weak topology on X, is defined to be the weakest topology 
on X (that is, the topology with the fewest number of sets) for which each function in 
F is continuous. A base at x € X for the F-weak topology on X comprises sets of the 
form 


Ne, fy on fy (%) = {x EX | [fe(x") — fe(x)| <€ for 1 <k <n}, (3) 


where € > Q and { f;}7_, is a finite subcollection of ¥. For a topology on a set, we know what 
it means for a sequence in the set to converge, with respect to the topology, to a point in the 
set. It is easy to see that a sequence {x,} in X converges to x € X with respect to the F-weak 
topology if and only if 


Tim f(%n) = f(x) for all f € F. (4) 


A function on X that is continuous with respect to the F-weak topology is called F-weakly 
continuous. Similarly, we have F-weakly open sets, F-weakly closed sets, and F-weakly 
compact sets. 

For a linear space X, it is natural and very useful to consider weak topologies induced 
on X by linear subspaces W of X?. 


Proposition 5 Let X be a linear space and W a subspace of X*. Then a linear functional 
p: X — Ris W-weakly continuous if and only if it belongs to W. 


Proof By the definition of the W-weak topology, each linear functional in W is W-weakly 
continuous. It remains to prove the converse. Suppose the linear functional y: X > R is 
W-weakly continuous. By the continuity of w at 0, there is a neighborhood WV of 0 for which 
lWo(x)| = I(x) —p(0)| <1 if x € N. There is a neighborhood in the base for the W-topology 
at 0 that is contained in VV. Choose € > 0 and 1, ..., #, in W for which Ny, y, CN. 
Thus 


aang 


le(x)| <1 if [by (x)| < for alll <k <n. 


By the linearity of w and the y;’s, we have the inclusion N;_, ker x C ker. According to 
Proposition 4, y is a linear combination of ys, ..., %,. Therefore, since W is a linear space, 
W belongs to W. LI 


The above proposition establishes a one-to-one correspondence between linear sub- 
spaces of X# and weak topologies on X induced by such subspaces. 


Definition Let X be a normed linear space. The weak topology induced on X by the dual 
space X* is called the weak topology on X. 


A base at x € X for the weak topology on X comprises sets of the form 


Nether (4) = {x EX | le (x! — x)| < efor 1 <k <n}, (5) 
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where € > 0 and {y}7_, 18 a finite subcollection of X*. For topological concepts with 
respect to the weak topology, we use the adjective “weakly”: so we have weakly compact 
sets, weakly open sets, etc. Thus a sequence {x,} in X converges weakly to x € X if and 
only if 
lim w(xp) = W(x) for all p € X*. (6) 
n> O&O 
It is convenient to write {x,}— x in X to mean that {x,} is a sequence in X that converges 
weakly to the point x € X. 
For X a normed linear space and W a subspace of X*, there is the following inclusion 
among topologies on X: 


W-weak topology on X C weak topology on X C strong topology on X. 


We infer from Proposition 5 that the W-weak topology coincides with the weak topology if 
and only if W = X*. Furthermore, the weak topology coincides with the strong topology if 
and only if X is finite dimensional (see Problem 6). Frequently, for a normed linear space, we 
call the topology induced by the norm the strong topology on X. If no adjective is attached 
to a topological concept associated with a normed linear space, it is implicitly assumed that 
the reference topology is the strong topology. 

For normed linear spaces that are dual spaces, there is a third important topology on 
the space besides the weak and the strong topologies. Indeed, for a normed linear space X 
and x € X we define the functional J(x): X* > R by 


J(x)[b] = w(x) for all & € X*. 


It is clear that the evaluation functional J(x) is linear and is bounded on X* with ||J(x)|| < 
||x||. Moreover, the operator J: X — (X*)* is linear and therefore J(X) is a linear subspace 
of (X*)*. 


Definition Let X be a normed linear space. The weak topology on X* induced by J(X) C 
(X*)* is called the weak-« topology on X*. 


A base at # € X* for the weak-« topology on X* comprises sets of the form 


Ne, xy, onan (UW) = {Wi € X* | I(! — b)(xx)| < efor 1 < k <n}, (7) 


where ¢€ > 0 and {x;,}7_, is a finite subset of X. A subset of X** that is open with respect 
to the weak-+ topology is said to be weak-* open, similarly for other topological concepts. 
Thus a sequence {,,} in X* is weak-* convergent to p € X* if and only if 


im w(x) = w(x) for all x € X. (8) 


Therefore weak-+ convergence in X* is simply pointwise convergence. For a normed linear 
space X, the strong, weak, and weak-* topologies on X* are related by the following 
inclusions: 


weak- * topology on X* C weak topology on X* C strong topology on X*. 
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Definition Let X be a normed linear space. The linear operator J: X — (X*)* defined by 
J(x)[p] = (x) for all x © X,p € X* 


is called the natural embedding of X into (X*)*. The space X is said to be reflexive provided 
J(X) = (X*)*. 


It is customary to denote (X*)* by X** and call X** the bidual of X. 


Proposition 6 A normed linear space X is reflexive if and only if the weak and weak-x 
topologies on X* are the same. 


Proof Clearly if X is reflexive, then the weak and weak-x topologies on X* are the same. 
Conversely, suppose these two topologies are the same. Let ¥: X* —> R be a continuous 
linear functional. By definition of the weak topology, V is continuous with respect to the 
weak topology on X*. Therefore it is continuous with respect to the weak-x topology. We 
infer from Proposition 5 that V belongs to J(X). Therefore J (X) = xX™, O 


At present, we are not justified in calling J: X > X** an “embedding” since we have 
not shown that J is one-to-one. In fact, we have not even shown that on a general normed 
linear space X there are any nonzero bounded linear functionals. We need a variation 
of Proposition 2 for linear functionals that are bounded. The forthcoming Hahn-Banach 
Theorem will provide this variation and, moreover, show that J is an isometry. Of course, we 
have already studied the dual spaces of some particular normed linear spaces. For instance, 
if E is a Lebesgue measurable set of real numbers and 1 < p< ov, the Riesz Representation 
Theorem characterizes the dual of L?(£). 


PROBLEMS 
I. Verify the two direct substitution assertions in the proof of Proposition 4. 


2. Let Xo be a codimension 1 subspace of a normed linear space X. Show that Xo is closed with 
respect to the strong topology if and only if the Xp = ker w for some we X*, 


3. Show that if X is a finite dimensional normed linear space, then every linear functional on X 
is continuous. 


4. Let X be a finite dimensional normed linear space of dimension n. Let {e;,..., e,} be a basis 
for X. For 1 <i <n, define yj € X* by Wi(x) = x; for x = xje; +---+x,e, € X. Show that 
{u,..., W,} is a basis for X*. Thus dim X* =n. 


5. Let X be a finite dimensional linear space. Show that the weak and strong topologies on X 
are the same. 


6. Show that every nonempty weakly open subset of an infinite dimensional normed linear space 
is unbounded with respect to the norm. 


7. Let X be a finite dimensional space. Show that the natural embedding J: X > X** is 
one-to-one. Then use Problem 4 to show that J: X > X** is onto, so X is reflexive. 


8. For a vector v#0 in Euclidean space R", explicitly exhibit a linear functional y: R? > R for 
which w(v) = 1. 
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9. For a sequence {x,} #0 in ¢”, explicitly exhibit a continuous linear functional y: ¢2 > R for 
which w( {x,}) = 1. 


10. For a function f #0 in L?[a, b],1 < p < oo, explicitly exhibit a continuous linear functional 
w: L?[a, b] > R for which ( f) = 1. 


11. Consider C[a, b] with the maximum norm. For a function f #0 in C[a, 5], explicitly exhibit a 
continuous linear functional y: C[a, b] > R for which Wf) = 1. 


12, For 1 < p< ow, let Y be a closed subspace of L?[a, b] of codimension 1. Show that there is 
function g € L%[a, b], where q is the conjugate of p, for which 


| jain = oh 
[a, 5] 


13. Let X be a normed linear space and w belong to X'~X*. Show that ker is dense, with 
respect to the strong topology, in X. 


r= {rete b] 


14. Let X be the normed linear space of polynomials restricted to [a, b] . For p € X, define p( p) 
to be the sum of the coefficients of p. Show that w is linear. Is » continuous if X has the 
topology induced by the maximum norm? 


15. Let X be the normed linear space of sequences of real numbers that have only a finite 


CO 
number of nonzero terms. For x = {x,} € X, define (x) = > x,. Show that y is linear. Is p 
=1 


n 
continuous if X has the topology induced by the © norm? 


16. Let X be a linear space. A subset E of X is said to be linearly independent provided each 
x € E fails to be a finite linear combination of points in E~{x}. Define F to be the collection 
of nonempty subsets of X that are linearly independent. Order F by set inclusion. Apply 
Zorn’s Lemma to conclude that X has a Hamel basis. 


17. Provide an example of a discontinuous linear operator T from a normed linear space X to a 
normed linear space Y for which T has a closed graph. (Hint: Let & be a discontinuous linear 
functional on a normed linear space X and Y = {y € X XR| y = (x, (x))}, the graph of ). 
Does this contradict the Closed Graph Theorem? 


14.2 THE HAHN-BANACH THEOREM 


Definition A functional p: X--> [0, 00) on a linear space X is said to be positively 
homogeneous provided 


p(Ax) = Ap(x) forallx € X,A>0, 
and said to be subadditive provided 
P(x+y) < p(x) + ply) forallx,y € X. 


Any norm on a linear space is both subadditive (the triangle inequality) and positively 
homogeneous. 


The Hahn-Banach Lemma _ Let p be a positively homogeneous, subadditive functional on the 
linear space X and Y a subspace of X on which there is defined a linear functional & for which 


wu < pony. 
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Let z belong to X~Y. Then y can be extended to a linear functional p on span [Y +z] for which 
is < pon span[Y + z]. 


Proof Since every vector in span [Y + z] may be written uniquely as y + Az, for y € Y and 
A&R, itis sufficient to find a number y(z) with the property that 


w(y) +Ab(z) < p(y+Az) for ally € YandAeER. (9) 


Indeed, for such a number f(z), define (y+ Az) = W(y) + AW(z) for all yin YandAeER 
to obtain the required extension. 
For any vectors y;, y2 € Y, since # is linear, yw < pon Y and pis subadditive, 


Wy) + ¥(y2) = #(y1 +92) S$ P(y1 +92) = P((M1 —z) + (92 +2)) S PC —z) + P(¥2 +2), 
and therefore 
W(y1) — P01 — 2) S Wyo) + p(y2 +2). 

As we vary y; and y2 among all vectors in Y, any number on the left-hand side of this 
inequality is no greater than any number on the right. By the completeness of R, if we 
define (z) to be the supremum of the numbers on the left-hand side of this inequality, then 
(z) € R. Furthermore, for any y € Y, #(y) — p(y —z) < W(z), by the choice of #(z) as an 
upper bound and (z) < —p(y) + p(y +z) by the choice of (z) as the least upper bound. 
Therefore 

WY) — P(y—z) SH(z) < -W(y) + p(y +z) for all y € Y. (10) 
Let y belong to Y. For A > 0, in the inequality w(z) < —#(y) + p(y +z), replace y by 
y/A, multiply each side by A, and use the positive homogeneity of both p and y to obtain 
the desired inequality (9). For A < 0, in the inequality y(y) — p(y —z) < #(z), replace y 
by —y/A, multiply each side by —A, and once more use positive homogeneity to obtain the 
desired inequality (9). Therefore (9) holds if the number w(z) is chosen so that (10) holds. 0 


The Hahn-Banach Theorem Let p be a positively homogeneous, subadditive functional on a 
linear space X and Y a subspace of X on which there is defined a linear functional w for which 


ww < pony. 
Then is may be extended to a linear functional on all of X for which s < p on all of X. 


Proof Consider the family F of all linear functionals y defined on a subspace Y, of X for 
which Y CY, n = w on Y, and n < pon Yp. This particular family F of extensions of wy 1s 
partially ordered by defining n; < 7 provided Yn, G Yq) and m1 = 92 on Yy,. 

Let Fo be a totally ordered subfamily of ¥. Define Z to be the union of the domains 
of the functionals in Fo. Since Fo is totally ordered, any two such domains are contained 
in just one of them and therefore, since each domain is a linear subspace of X, so is Z. 
For z € Y, choose 7 € Fo such that z € Y,,: define n*(z) = n(z). Then, again by the total 
ordering of Fo, n* is a properly defined linear functional on Z. Observe that n* < p on 
Z, Y C Z and 7* = on Y, since each functional in Fo has these three properties. Thus 
7 < 7" for all y € Fo. Therefore every totally ordered subfamily of F has an upper bound. 
Hence, by Zorn’s Lemma, F has a maximal member yp. Let the domain of fp be Yo. By 
definition, Y C Yo and Ww < pon Yo. We infer from the Hahn-Banach Lemma that this 
maximal extension wv is defined on all of X. L 
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Theorem 7 Let Xo be a linear subspace of a normed linear space X. Then each bounded 
linear functional ys on Xo has an extension to a bounded linear functional on all of X that has 
the same norm as sp. In particular, for each x € X, there is a € X* for which 


p(x) = lI and |\p|| = 1. : (11) 
Proof Let yw: Xo — R be linear and bounded. Define 


= |jwll = sup {1W(x)| | x © Xo, [xl] <1}. 


Define p: X > R by | 
| | p(x) = M - ||x|| for all x € X. 


The functional p is subadditive and positively homogeneous. By the definition of M, 
wy < pon Xo. 


By the Hahn-Banach Theorem, # may be extended to a continuous linear functional 
on all of X and (x) < p(x) = M||x|| for allx € X. Replacing x by —x we infer that 
le(x)| < p(x) = M||x|| for all x € X and therefore the extension of y to all of X has the 
same norm as: Xo > R. 


Now let x belong to X. Define n: span[x] > R by (Ax) = A- |[x/]. Observe that 
||q|| = 1. By the first part of the proof, the functional 7 has an extension to a bounded linear 
functional on all of X that also has norm 1. | L 


Example Let xo belong to the closed, bounded interval [a, b]. Define 
(Ff) = (20) for all f € Cla, b]. 


We consider C[a, b] as a subspace of L™[a, b] (see Problem 27). We infer from the preceding 
theorem that y has an extension to a bounded linear functional y : L©[a, b] > R. 


Example Define the positively homogeneous, subadditive functional pon £™ by 
P({xn}) = lim sup{x,} for all {x,} € 2°. 
Let cp C £™ be the subspace of convergent sequences. Define L on co by 
L({xn}) = im x, for all {xn} E Co. 


Since L is linear and L < poncg, L has an extension to a linear functional L on £~ for which 
L < pon &™. Any such extension is called a Banach limit. . 


In the preceding chapter we considered whether a closed subspace Xo of a Banach 
space X has a closed linear complement in X. The following corollary tells us it does if Xo is 
finite dimensional. 


Corollary 8 Let X be a normed linear space. If Xq is a finite dimensional subspace of X, then 
there is a closed linear subspace X, of X for which X = Xo ® X. 
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Proof Let e1,...,e, be a basis for Xg. For 1 < k <n, define yy: Xp > R by (> Xj - e;) 


iz 
= A,. Since Xo 1s finite dimensional, the ,’s are continuous. According to Theorem 7, each 
w, has an extension to a continuous functional #, on all of X. Therefore each yy, has a 
closed kernel so that the subspace X; = N/_, ker, also is closed. It is easy to check that 
X=X)@X. O 


Corollary 9 Let X be a normed linear space. Then the natural embedding J: X > X** is an 
isometry. 


Proof Let x belong to X. Observe that by the definition of the norm on the dual space 
lx) | < bl - [all for alll y © X*. 


Thus 
|J(x)()] < [xl] - [pl] for all we X*. 


Therefore J(x) is bounded and ||J(x)|| < ||x||. On the other hand, according to Theorem 7, 
there is a W € X* for which ||| = 1 and J(x)(W) = ||x||. Therefore ||x|| < ||J(x)||. We 
conclude that J is an isometry. LJ 


Theorem 10 Let Xo be a subspace of the normed linear space X. Then a point x in X belongs 
to the closure of Xo if and only if whenever a functional  € X* vanishes on Xo, it also vanishes 
at x. 


Proof It is clear by continuity that if a continuous functional vanishes on Xo it also vanishes 
on the closure of Xo. To prove the converse, let xp belong to X~X0. We must show that 
there is aw € X* that vanishes on X but (xo) #0. Define Z = Xo ® [xo] and wy: Z > R by 


(x + Axo) = A for all x e Xp andA ER. 


We claim that w is bounded. Indeed, since Xo is closed, its complement is open. Thus there 
is an r > 0 for which ||u — xo|| > r for all u € Xo. Thus, for x € Xp andA ER, 


|x + Axoll = [AlI|(-1/A- x) — xoll = [Al -r- 


From this we infer that #: Z > R is bounded with ||W|| < 1/r. Theorem 7 tells us that 
has an extension to a bounded linear functional on all of X. This extension belongs to X”*, 
vanishes on Xo, and yet (x9) £0. a 


We leave the proof of the following corollary as an exercise. 


Corollary 11. Let S be a subset of the normed linear space X. Then the linear span of S is 
dense in X if and only if whenever ib € X* vanishes on S, then & = 0. 


Theorem 12 Let X be a normed linear space. Then every weakly convergent sequence in X is 
bounded. Moreover, if {xn} — x in X, then 


|x|] < lim inf [lI (12) 
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Proof Let {x,} — xin X. Then {J(x,): X* > R}isa sequence of functionals that converges 
pointwise to J(x): X* — R. The Uniform Boundedness Theorem tells us that {J (xn )} is a 
bounded sequence of linear functionals on X*. Since the natural embedding J is an isometry, 
the sequence {x,} is bounded. To verify (12), according to Theorem 7, there is a functional 
w € X* for which ||| = 1 and W(x) = ||x||. Now | 


(Xn )| S [ell - fleall = [xl] for alll n. 
Moreover, |{i(x, )}| converges to |y(x)| = ||x||. Therefore 


xl] = lim |h(xn )| < liming |x. 7 
Remark The Hahn-Banach Theorem has a rather humble nature. The only mathematical 
concepts needed for its statement are linear spaces and linear, subadditive, and positively 
homogeneous functionals. Besides Zorn’s Lemma, its proof relies on nothing more than the 
rudimentary properties of the real numbers. Nevertheless, often by making a clever choice of the 
functional p, this theorem permits us to create basic analytical, geometric, and topological tools 
for functional analysis. We established Theorem 7 by applying the Hahn-Banach Theorem 
with the functional p chosen to be a multiple of the norm. In Section 4 of this chapter, we use 
the Hahn-Banach Theorem with p the so-called gauge functional associated with a convex set 
to separate disjoint convex subsets of a linear space by a hyperplane. In the next chapter, we 
use the natural embedding J of a normed linear space into its bidual to prove that the closed 
unit ball of a Banach space X is weakly sequentially compact if and only if X is reflexive. ! 


PROBLEMS | 


18. Let X be a normed linear space, belong to X*, and {,} be in X*. Show that if {Wi} converges 
weak-+* to w, then | | 


lel] < lim sup [Pal 


19. Let X = R" be normed with the Euclidean norm, Y a subspace of X, and wy: Y > Ra linear 
functional. Define Y+ to be the linear subspace of R" consisting of vectors orthogonal to Y. 
Then R" = Y @Y+. Forx=y+y’, yeY, y € ¥+, define w(x) = wy). Show that this 
properly defines ¢ ¢ (R")*, is an extension of ¢ on Y, and has the same norm as ly. 


20. Let X = L? = L?[a, b],1< p<oo and m be Lebesgue measure. For f #0 in L’, define 
1 

h)=——_; 
If lp 


Use Holder’s Inequality to show that y € (L?)*, |lp|| =1 and &( f) = If Il,- 
21. For each point x in a normed linear space X, show that 


| sen( f)-| fl?! -hdm for all h € L?. 
[a, b] 


lx] = sup {y(x) | pe x*, Ip <1}. 


lAn interesting extension of the Hahn-Banach Theorem due to Agnew and Morse and a variety of applications 
of the Hahn-Banach Theorem may be found in Peter Lax’s Functional Analysis [Lax02]. 
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22. 


23. 


24. 


25. 


26. 


27. 


Let X be a normed linear space and Y a closed subspace of X. Show that for each xp € X~Y, 
there is a € X* such that 


[Iwill =1, » =0 on ¥ and Y(xo) =d, where d = dist(xo, Y) = inf {\]xo — yll | ye Y}. 


Let Y be a linear subspace of a normed linear space X and z be a vector in X. Show that 
dist(z, Y) = sup {#(z)| Iv] =1,4=O0o0n¥}. 


Let X be a vector space. A subset C of X is called a cone provided x+ y € C and Ax EC 
whenever x, y belong to C and A > 0. Define a partial order in X by defining x < y to mean 
y—x €C. A linear functional f on X is said to be positive (with respect to the cone C) 
provided f > 0 on C. Let Y be any subspace of X with the property that for each x € X 
there is a y € Y with x < y. Show that each positive linear functional on Y may be extended 
to a positive linear functional on X. (Hint: Adapt the Hahn-Banach Lemma and use Zorn’s 
Lemma with respect to the relation < to find a maximal extension.) 


Let Xo be a subset of a metric space X. Use the Tietze Extension Theorem to show that every 
continuous real-valued function on Xo has a continuous extension to all of X if and only if Xo 
is closed. Does this contradict Theorem 7? 


Let (X, p) be a metric space that contains the closed set F. Show that a point x € X belongs 
to F if and only if every continuous functional on X that vanishes on F also vanishes at x. 
Can this be used to prove Theorem 10? 


Let [a, b] be a closed, bounded interval of real numbers and consider L™[a, b], now formally 
considered as the collection of equivalence classes for the relation of pointwise equality 
almost everywhere among essentially bounded functions. Let X be the subspace of L™[a, b] 
comprising those equivalence classes that contain a continuous function. Show that such an 
equivalence class contains exactly one continuous function. Thus X is linearly isomorphic 
to C[a, b] and therefore, modulo this identification, we may consider C[a, b] to be a linear 
subspace L™[a, b]. Show that Cla, b] is a closed subspace of the Banach space L™[a, 5]. 


. Define y: Cla, b] > R by &( f) = f(a) for all f € Cla, b]. Use Theorem 7 to extend yp toa 


continuous linear functional on all of L~[a, b] (see the preceding problem). Show that there 
is no functional h € L}[a, b] for which 


b 
wn=| h- f for all f ¢ L™[a, bd]. 


14.3 REFLEXIVE BANACH SPACES AND WEAK SEQUENTIAL CONVERGENCE 


Theorem 13 Let X be a normed linear space. If its dual space X* is separable, then X also is 
separable. 


Proof Since X* is separable, so is its closed unit sphere S* = {y € X* | |||] = 1}. Let {hn}, 
be a countable dense subset of S*. For each index n, choose x, € X for which 


[nl] = 1 and Pn (xn) > 1/2. 


Define Xo to be the closed linear span of the set {x,|1 <n < oo}. Then Xq is separable 
since finite linear combinations, with rational coefficients, of the x,’s is a countable set that is 
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dense in Xo. We claim Xo = X. Indeed, otherwise, by Theorem 10, we may choose * € X* 
for which 
|v*|| = 1 and y" = 0on Xp. 


Since {y,}°°., is dense in S*, there is a natural number no for which ||y* — Wnol| < 1/2. 
Therefore 


(Ying — W°) (Xing IS Ming — WU + Ung ll < 1/2 and yet (ng — W*) (Xing ) = Png (Xing) > 1/2. 


From this contradiction we infer that X is separable. | | LI 
Corollary 14 A reflexive Banach space is separable if and only if its dual is separable. 


Proof Let X be a Banach space. The preceding theorem tells us that if X* is separable so 
is X, irrespective of any reflexivity assumption. Now assume that X is reflexive and separable. 
Thus J(X) = X** = (X*)* is separable since J is an isometry. According to the preceding 
theorem, with X replaced by X*, X* is separable. O 


Proposition 15 A closed subspace of a reflexive Banach space is reflexive. 


Proof Let Xo be a closed subspace of reflexive Banach space X. Define J to be the natural 
embedding of X in its bidual X**. Let Jo be the natural embedding of Xo in its bidual X}*. 
To show that Jo is onto, let 5 belong to X>*. Define S’ € X** by | 


S'(w) = S(wlx, ) for all w € X*. 


Then 5S’: Xj, > R is linear and it is bounded with ||5’|| < ||S|]. By the reflexivity of X, there 
is an xo € X for which S’ = J(xo). But if fy € X* vanishes on Xo, then S’(y) = 0, so that 


(x0) = J(xo )[p] = S’(#) = 0. 
Theorem 10 tells us that xo belongs to Xo. Therefore S = Jo(x0). CJ 


We record again Helley’s Theorem, which we proved in Chapter 8. 


Theorem 16 (Helley’s Theorem) Let X be a separable normed linear space. Then every 
bounded sequence {in} in X* has a subsequence that converges pointwise on X to w € X*, that 
is, {Jn} has a subsequence that converges to with respect to the weak-x topology. 


Theorem 17 Let X be a reflexive Banach space. Then every bounded sequence in X has a 
weakly convergent subsequence. 


Proof Let {x,} be a bounded sequence in X. Define Xo to be the closure of the linear span of 
the set (x, |n € N}. Then Xo is separable since finite linear combinations of the x,’s, with ra- 
tional coefficients, is a countable set that is dense in Xo. Of course XQ is closed, Proposition 15 
tells us that Xo is reflexive. Let Jo be the natural embedding of Xo in its bidual X}*. It fol- 
lows from Proposition 15 that Xj also is separable. Then {Jo(x, )} is a bounded sequence of 
bounded linear functionals on a separable Banach space X%. According to Helley’s Theorem, 
a subsequence {Jo( xn, )} converges weak-x to S € (X})*. Since Xo is reflexive, there is some 
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xy € Xo for which S = Jo(x). Since every functional in X* restricts to a functional in Xj, 
the weak-x convergence of {Jo(xn, )} to Jo(xo) means precisely that {x,,} converges weakly 
to xg. 


Corollary 18 Let X be a reflexive Banach space. Then every continuous real-valued linear 
functional on X takes a maximum value on the closed unit ball B of X. 


Proof Let y belong to X*. The supremum of the functional values of w on B is |||. Choose 
a sequence {x,,} in B for which limy-.o W(X, ) = ||W||. In view of Theorem 17, we may assume 
that {x,} converges weakly to xp. According to (12), xg belongs to B. Since 


lim W(Xn ) = (x0), 


n-> OO 


yw takes a maximum value on B at xo. L 


Theorem 17 makes it interesting to identify which of the classical Banach spaces 
are reflexive. 


Proposition 19 Let [a, b| be a closed, bounded interval of real numbers. Then C{a, b}, 
normed with the maximum norm, is not reflexive. 


Proof Assume [a, b] = [0, 1]. For x € [0, 1], define the evaluation functional y,: C[0, 1] > 
R by ,( f) = f(x). Then , is a bounded linear functional on C[0, 1]. Therefore, if { f,} 
converges weakly to f in C[0, 1], then {f,} — f pointwise on [0, 1]. For a natural number 
n, define f,(x) = x" for x € [0, 1]. Then { f,} converges pointwise to a function f that is not 
continuous. Therefore no subsequence can converge pointwise to a continuous function and 
hence no subsequence can converge weakly to a function in C[0, 1]. We infer from Theorem 
17 that C[0, 1] fails to be reflexive. U 


Proposition 20 For 1 < p< oo and E a Lebesgue measurable set of real numbers, L?( E) 
is reflexive. 


Proof Let g be conjugate to p. Define the Riesz representation mapping R from L?(E) to 
(L?(E))* by 


R(e)Lfl= fe _f for all g € L%(E), f € L?(E). 


The Riesz Representation Theorem tells us that R is an isomorphism of L4(E) onto 
(L?(E))*. Let T be a bounded linear functional on (L?(£))*. We must show that there is a 
function f € L?(E) for which T = J( f ), that is, since R is onto, 


T(R(g)) = I(F)R(g)) = R(g)Lfl = [ g- f for all g € L9(B). (13) 


However, the composition T o R is a bounded linear functional on L4(E). The Riesz 
Representation Theorem, with p and q interchanged, tells us that there is a function 
f € L?(E) for which 


(ToR)[g| = [ Fete all g € L?(E). 
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Thus . a | | a 
r(R(e)) = (ToR)Iel= f f+ = a g- f for all g € L4(E), 


that is, (13) holds. LO 

In general, neither L'(£) nor L*(E) 1 is reflexive. Consider E = [0, 1]. Observe that 
L'[0, 1] is separable while (L![0, 1])* is not separable since it is isomorphic to L™[0, 1], 
which is not separable. We infer from Corollary 14 that L'[0, 1] is not reflexive. Observe 
that C[0, 1] is a closed subspace of L™[0, 1] (see Problem 27). By the preceding proposition, 
C[0, di is not reflexive and therefore Proposition 15 tells us that neither is L©[0, 1]. 


Remark Some care is needed when establishing reflexivity. R.C. James has given an example 
of a Banach space X that is isomorphic to X** but fails to be reflexive.” Reflexivity requires 
not just that X is isomorphic to X**: it requires that the natural embedding J of X into X*™* i 
an isomorphism. | | 


Remark The contrast between reflexivity for spaces of integrable functions and spaces of 
continuous functions is striking. The forthcoming Theorem 8 of Chapter 19 tells us that for 
1 < p< 0, the most general L? spaces are reflexive. On the other hand, if K is any compact 
Hausdorff space and C( K ) is normed by the maximum norm, then C(K) is reflexive if and 
only if K is a finite set (see Problem U of Chapter 15 ). 


PROBLEMS 


29. Show that a collection of bounded linear functions is equicontinuous if and only if it 1s 
uniformly bounded. 


30. Let X be a separable normed linear space. Show that its closed unit sphere S={xeX||lx] 
= 1} also is separable. 


31. Find a compact metric space X for which C(X), normed by the the maximum norm, is 
reflexive. | 


32. Let co be the subspace of 2 consisting of sequences that converge to 0. Show that co is a 
closed subspace of 2 whose dual space is isomorphic to £!. Conclude that co is not reflexive 
and therefore neither is 2™. 7 


33. For 1 < p < 00, show that the sequence space £? is reflexive if and only if 1 < p < oo. (For 
p = 00, see the preceding problem.) 


34. Consider the functional » € (C[—1, 1])* defined by 


0 t. ; 
n= | al h for all h € C[-1, 1]. 
J —] 0 


Show that w fails to take a maximum on the closed unit ball of C[—1, 1]. Use this to provide 
another proof that C[—1, 1] fails to be reflexive. 


35. For 1 < p < co, show that a bounded sequence in £? converges weakly if and only if it 
converges componentwise. 


aN non-reflexive Banach space isomorphic to its second dual,” Proc. Nat. Acad. Sci., 37, 1951. 
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36. For 1 < p < o and [a, b] a closed, bounded interval of real numbers, show that a bounded 
sequence {f,} in L?[a, b] converges weakly to f if and only if {f, f:} > Jf, f for every 
Lebesgue measurable subset E of [a, 5]. 

37. For [a, b] a closed, bounded interval of real numbers, show that if a sequence { f;,} in C[a, 5] 
converges weakly, then it converges pointwise. 


38. For X and Y normed linear spaces and an operator S € L(X, Y), define the adjoint of S, 
S* € L(Y*, X*) by 
[S*(w)](x) = W(S(x)) for all € Y*,x € X. 


(i) Show that ||S*|| = ||S|| and that S* is an isomorphism if S is an isomorphism. 


(i) For 1 < p<ooand X = L?(E), where E is a measurable set of real numbers, show that 
the natural embedding J: X + X** may be expressed as the composition 


where R, and R, are the Riesz representing operators. 


39. Let X be a reflexive Banach space and T: X — X a linear operator. Show that T belongs 
to £(X, X) if and only if whenever {x,} converges weakly to x, {T(x,)} converges weakly 
to T(x). 


14.4 LOCALLY CONVEX TOPOLOGICAL VECTOR SPACES 


There is a very nice class of topologies on a vector space X, topologies for which X is said 
to be a locally convex topological vector space, which, for our purposes, has two virtues: 
This class is large enough so that, for a normed linear space X, it includes both the strong 
topology on X induced by a norm and the weak topologies on X induced by any subspace 
W of X* that separates points. On the other hand, this class of topologies is small enough so 
that for linear spaces with these topologies, if K is a closed convex set that does not contain 
the point x9, there is a closed hyperplane passing through xo that contains no point of K. 
For two vectors u, v in a linear space X, a vector x that can be expressed as 


x=)hu+(1—A)vfor0<A<1 


is called a convex combination of u and v. A subset K of X is said to be convex provided it 
contains all convex combinations of vectors in K. Every linear subspace of a linear space is 
convex, and the open and closed balls in a normed space also are convex. 


Definition A locally convex topological vector space is a linear space X together with a 
Hausdorff topology that possesses the following properties: 


(t) Vector addition is continuous, that is, the map (x, y)>x + y is continuous from X X X 
to X; 

(i) Scalar multiplication is continuous, that is, the map (A, x) A -x is continuous from 
RX X to X; 


(iii) There is a base at the origin for the topology consisting of convex sets. 


For a normed linear space X, a subspace W of X* is said to separate points in X 
provided for each u, v € X, there is ab € W for which f(u )#y(v). Recall that for a subspace 
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W of X* and point x € X, a neighborhood base of x with respect to the W-weak topology 
comprises sets of the form 


geeey 


where € > 0 and each i; belongs to W. 


Proposition 21 Let X be a normed linear space. Then the linear space X is a locally convex 
topological vector space with respect to the topology induced by the norm and also with respect 
to the W-weak topology induced by any subspace W of X* that separates points in X. 


Proof First consider X with the topology induced by the norm. Since the topology is induced 
by a metric, it is Hausdorff. From the subadditivity and homogeneity of the norm we infer 
that vector addition and scalar multiplication are continuous. Finally, each open ball centered 
at the origin is convex and the collection of such balls is a base at the origin for the topology 
induced by the norm. 

Now let W be a subspace of X* that separates points. To show that the W-weak topology 
is Hausdorff, let u and v be distinct vectors in X. Since W separates points there is ay € W such 
that |y(u)—w(v)| = r>0. Then {x € X | |s(u)—W(x)|<r/2} and {x € X | |b(v)—(x)|<r/2} 
are disjoint W-weak neighborhoods of u and v, respectively. To show that vector addition is 


gore Wy NO RF EG Lag YE geeey 


reer Vy VL FJ Ef LW, Wn NS FEY Tees 


Thus vector addition is continuous at (x;, x2) € X X X. A similar argument shows that 
scalar multiplication is continuous. Finally, a basic neighborhood of the origin is of the form 
N eh... (0) and this set is convex since each yw; is linear. O 


goes 


Definition Let E be a subset of a linear space X. A point xo € E is said to be an internal point 
of E provided for each x € X, there is some Ag > 0 for which xp + A-x belongs to E if |A| < Apo. 


Proposition 22 Let X be a locally convex topological vector space. 
(i) A subset N of X is open if and only if for each xy € X and A#0, x9 +N and A-N 
are open. 
(ii) The closure of a convex subset of X is convex. 
(iii) Every point in an open subset O of X is an internal point of O. 


Proof We first verify (i). For xo € X, define the translation map T,,: X > X by T,,(x) = 
x + x9. Then T,, is continuous since vector addition is continuous. The map T_,, also is 
continuous and is the inverse of T,,. Therefore T,, is a homeomorphism of X onto X. Thus 
N is open if and only if A’ + x9 is open. The proof of invariance of the topology under 
nonzero scalar multiplication is similar. 


To verify (ii), let K be a convex subset of X. Fix A € [0, 1]. Define the mapping 
W: XXX —> X-by 


W(u, v) =Au+(1—A)vforallu,ve X. 
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Since scalar multiplication and vector addition are continuous, ¥: X X X > X is continuous, 
where X x X has the product topology. A continuous mapping maps the closure of a set into 
the closure of the image of the set. Thus ¥(K XK) C V(K X K). However, KX K = KXK. 
Moreover, since K is convex, V(K X K) C K. Therefore V(K x K) C K. Since this holds for 
all A € [0, 1], the closure of K is convex. 

To verify (iii), let x9 belong to O. Define g: RX X > X by g(A, x) =A-x+ x0. Since 
scalar multiplication is continuous, the mapping g is continuous. But g(0, 0) = x9 and O is 
a neighborhood of g(0, 0). Thus there is a neighborhood N, of 0 € R and a neighborhood 
No of 0 € X for which g(N1 X N2) C O. Choose Ag > 0 for which [—Ag, Ag] C Ny. Then 
xo +A-x belongs to E if |A] < Ao. U 


Proposition 23 Let X be a locally convex topological vector space and : X — R be linear. 
Then yf is continuous if and only if there is a neighborhood of the origin on which || is 
bounded, that is, there is a neighborhood of the origin, No, and an M > 0 for which 


ly] < Mon No. (14) 


Proof First suppose y is continuous. Then it is continuous at x = 0 and so, since (0) = 0, 
there is a neighborhood No of 0 such that |yW(x)| = |(x) — W(0)| <1 for x € No. Thus |y! 
is bounded on No. To prove the converse, let No be a neighborhood 0 and M > 0 be such 
that (14) holds. For each A > 0, A- No is also a neighborhood of 0 and || < A-M onA- No. 
To verify continuity of #: X — R, let xo belong to X and € > 0. Choose A so that A- M <e. 
Then xp + A- Nis a neighborhood of xo and if x belongs to x9 +A- No, then x — xo belongs 
to A- No so that - 


(x) — W(x0)| = W(x — x9)| SAM <e. 


For an infinite dimensional normed linear space X, Theorem 9 of the following chapter 
tells us that the weak topology on X is not metrizable. Therefore in an infinite dimensional 
Space, care is needed in using weak sequential convergence arguments to establish weak 
topological properties. 


Example (von Neumann) For each natural number n, let e, denote the sequence in 7 
whose nth component is 1 and other components vanish. Define 


E = {é, +n-ém | nand m any natural numbers, m >n}. 


We leave it as an exercise to show that 0 is a point of closure, with respect to the weak 
topology, of E but there is no sequence in F that converges weakly to 0. 


Remark Two metrics on a set X induce the same topology if and only if a sequence that is 
convergent with respect to one of the metrics is convergent with respect to the other. Things 
are quite different for locally convex topological vector spaces. There are linear spaces X that 
have distinct locally convex topologies with respect to which the convergence of sequences is 
the same. A classic example of this is the sequence space X = €'. The space X is a locally 
convex topological vector space with respect to the strong topology and with respect to the 
weak topology and these topologies are distinct (see Problem 6). However, a lemma of Schur 
asserts that a sequence converges weakly in ¢' if and only if it converges strongly in ¢! 3 


3See Robert E. Megginson’s An Introduction to Banach Space Theory [Meg98]. 
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Remark A topological vector space is defined to be a linear space with a Hausdorff topology 
with respect to which vector addition and scalar multiplication are continuous. In the absence 
of local convexity, such spaces can be rather pathological. For instance, if 0 < p <1, let X be 
the linear space of all Lebesgue measurable extended real-valued functions on {0, 1] for which 
| f|? is integrable over [0, 1] with respect to Lebesgue measure. Define 


p(f, 8) =| , |f — gl? dm forall f, g € X. 


[0, 


Then, after identifying functions that are equal almost everywhere, p is a metric that induces a 
Hausdorff topology on X with respect to which vector addition and scalar multiplication are 
continuous. But there are no continuous linear functionals on X besides the zero functional 
(see Problem 50). In the next section we show that there are lots of continuous linear functionals 
on a topological vector space that is locally convex. 


PROBLEMS 


40. Let X be a normed linear space and W be a subspace of X*. Show that the W-weak topology 
on X is Hausdorff if and only if W separates points in X. 


41. Let X be a normed linear space and w: X — R be linear. Show that y& is continuous 
with respect to the weak topology if and only if it is continuous with respect to the 
strong topology. 


42. Let X be a locally convex topological vector space and yw: X > R be linear. Show that w is 
continuous if and only if it is continuous at the origin. 


43. Let X be a locally convex topological vector space and yw: X > R be linear. Show that yw is 
continuous if and only if there is a neighborhood O of the origin for which f(O) #R. 


44. Let X be a normed linear space and W a subspace of X* that separates points. For any 
topological space Z, show that a mapping f: Z — X is continuous, where X has the W-weak 
topology, if and only if & o f: Z > R is continuous for all # € W. 


45. Show that the topology on a finite dimensional locally convex topological vector space is 
induced by a norm. 


46. Let X be a locally convex topological space. Show that the linear space X’ of all linear 
continuous functionals wy: X > R also has a topology with respect to which it is a locally 
convex topological space on which, for each x € X, the linear functional p> (x) is 
continuous. 


47. Let X and Y be locally convex topological vector spaces and T: X — Y be linear, one-to one, 
and onto. Show that T is a topological homeomorphism if and only if it maps base at the 
origin for the topology on X to a base at the origin for the topology on Y. 


48. Let X be alinear space and the function 7: X — [0, 00) have the following properties: for all 
u,v eX, (i)o(ut+v) <a(u) +oa(v); (i) o(u) = Oif and only if u = 0; (ii) o(u) = o(—u). 
Define p(u, v) = o(u — v). Show that p is a metric on X. | 


49. (Nikodym) Let X be the linear space of all measurable real-valued functions on [0, 1]. Define 


_ fl 
N=] roa for all f € X. 
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(i) Use Problem 48 to show that p(u, v) = o(u — v) defines a metric on X. 

(ul) Show that {f,} — f with respect to the metric p if and only if { f,} > f in measure. 
(iii) Show that (X, p) is a complete metric space. 

(iv) Show that the mapping ( f, g) > f + g is a continuous mapping of X x X into X. 
(v) Show that the mapping (A, f)+>A- g is a continuous mapping of R X X into X. 


(vi) Show that there are no nonzero continuous linear functionals won X. (Hint: Let 
Ww: X — R be linear and continuous. Show that there is an n such that p(f) = 0 
whenever f is the characteristic function of an interval of length less than 1/n. Hence 
w( f) = 0 for all step functions f.) 


50. (Day) For 0 < p<1, let X be the linear space of all measurable (with respect to the Lebesgue 
measure m) real-valued functions on [0, 1] for which | f|? is integrable. Define 


o(f)= | |f|? dm for all f € X. 
(0, 1] 


(i) Use Problem 48 to show that p(u, v) = o(u — v) defines a metric on X. 


(ii) Show that the linear space X, with the topology determined by p, is a topological vector 
space. 


(iii) For a nonzero function f in X and natural number 7, show that there is a partition 
O= x9 <x <...x, =1 of [0, 1] for which fr* f= 1/n- fo f, for alll <k <n. 


(iv) For a nonzero function f in X and natural number n, show that there are functions 
fi, .-+, fn for which p( fe, 0) <1/n for 1 <k <nand 


f=SUn- fy 
k=1 


(v) Show that there are no continuous nonzero linear functionals on X. 
51. Let S be the space of all sequences of real numbers, and define 
a(x) = S nl for all x = {x,} € S. 
2"[1 + |xnl] 
Prove the analogues of (i), (iii), (iv), and (v) of the preceding problem. What is the most 
general continuous linear functional on S? 
14.5 THE SEPARATION OF CONVEX SETS AND MAZUR’S THEOREM 


In the first section of this chapter we showed that a hyperplane in a linear space X is the level 
set of a nonzero linear functional on X. We therefore say that two nonempty subsets A and 
B of X may be separated by a hyperplane provided there is a linear functional wy: X > R 
and c € R for which 

w<conAandw>conB. 


Observe that if A is the singleton set {xo}, then this means precisely that 


(xo ) < inf p(x). 
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Definition Let K be a convex subset of a linear space X for which the origin is an internal 
point. The gauge functional’ for K, px: X — [0, oo), is defined by 


Dx(x) = inf {A>0| xé€A-K} forallx € X. 


Note it is precisely because the origin is an internal point of the convex set K that its gauge 
functional is finite. Also note that the gauge functional associated with the unit ball of a_ 
normed linear space is the norm itself. 


Proposition 24 Let K be a convex subset of a linear space X that contains the origin as 
an internal point and px the gauge functional for K. Then px is subadditive and positively 
homogeneous. 


Proof We establish subadditivity and leave the proof of positive homogeneity as an exercise. 
Let u,v € X and suppose, for A > 0 and pz > 0, that x € AK and y € pK. Then, since K is 
convex, 
1 A xX woo 

——— + (x+ + —— eK 

yan OMG dep 
Therefore, x + y € (A+ 4)K so that px(x+ y) < A+. Taking infima, first over all such A 
and then over all such yz, we conclude that px(x+ y) < px(x) + pr(y). O 


The Hyperplane Separation Lemma Let K; and K2 be two nonempty disjoint convex subsets 
of a linear space X, one of which has an internal point. Then there is anonzero linear functional 
wy: X — R for which 3 


sup (x) < inf (x). | (15) 


xeK, xeEK> 


Proof Let x; be an internal point of K, and x2 any point of K2. Define 
f£=x%2- x and K = K; +[—Ko] +z. 


Then K is a convex set that contains the origin as an internal point and does not contain z. 
Let p = px: X > R be the gauge functional for K. Define Y = span[z] and the linear 
functional ys: Y > R by (Az) = A. Thus y(z) = 1, and since 1 < p(z) because z ¢ K, 
we conclude that y < pon Y. According to-the preceding proposition, p is subadditive and 
positively homogeneous. Thus the Hahn-Banach Theorem tells us that ¢ may be extended 
to a linear functional on all of X so that 1 < ponall of X. Let x € Ky and y € K>. Then 
x—y+zeXK so that p(x — y+z) <1 and thus, since y is linear and  < pon all of X, 


W(x) —W(y) + H(z) =W(x—ytz) < p(x—ytz) <1. 
Since #(z) = 1, we have w(x) <w(y). This hélds for each'x'e Ky and y in Kp, so 


sup wt) < ink w(y). 


. xeK, ° | 
Of course, / #0 since W(z)=1. |: | | | ho 


4A gauge functional is often called a: Minkowski functional. 
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The Hyperplane Separation Theorem Let X be a locally convex topological vector space, 
K a nonempty closed convex subset of X, and xq a point in X that lies outside of K. Then K 
and xq may be separated by a closed hyperplane, that is, there is a continuous linear functional 
Ww: X — R for which 

W(x0) < inf p(x). (16) 


Proof Since K is closed, X~K is open. Choose a convex neighborhood NV of 0 for which 
KN [No + x0] = @. 


We may, possibly by replacing No with Mo N[—No], suppose No is symmetric with respect 
to the origin, that is, V9 = —No. By the Hyperplane Separation Lemma, there is a nonzero 
linear functional #: X — R for which 

sup w(x) < inf w(x). (17) 

xEN 9+x0 xeK 
Since y#0, we may choose z € X such that #(z) >0. According to Proposition 22, an interior 
point of a set is an internal point. Choose A > 0 such that A-z € No. Since A(z) > 0 and 
Az + x9 € No + x0, we infer from the linearity of and inequality (17) that 
W(x0) <Ab(z) + W(x0) = W(Az+x0) < sup p(x) < inf (x). 
xEN +x xeK 

It remains to show that # is continuous. Define M = [inf ,<x(x)] — w(x). We infer from 
(17) that » < M on No. Since No is symmetric, || < M on No. By Proposition 23, w is 
continuous. LI 


Corollary 25 Let X be a normed linear space, K a nonempty strongly closed convex subset 
of X, and xq a point in X that lies outside of K. Then there is a functional  € X* for which 


W(x0) < inf o(2). (18) 


Proof According to Theorem 21, the linear space X is a locally convex topological vector 
space with respect to the strong topology. The conclusion now follows from the Hyperplane 
Separation Theorem. 


Corollary 26 Let X be a normed linear space and W a subspace of its dual space X* that 
separates points in X. Furthermore, let K be a nonempty W-weakly closed convex subset of X 
and xq a point in X that lies outside of K. Then there is a functional  € W for which 


(20) < inf w(x), (19) 


Proof According to Theorem 21, the linear space X is a locally convex topological vector 
space with respect to the W-weak topology. Corollary 5 tells us that the W-weakly continuous 
linear functionals on X belong to W. The conclusion now follows from the Hyperplane 
Separation Theorem. 


Mazur’s Theorem Let K be a convex subset of a normed linear space X. Then K is strongly 
closed if and only if it is weakly closed. 
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Proof Since each y € X* is continuous with respect to the strong topology, each weakly 
open set is strongly open and therefore each weakly closed set is strongly closed, irrespective 
of any convexity assumption. Now suppose K is nonempty, strongly closed, and convex. Let 
xo belong to X~K. By Corollary 25, there is a y € X* for which 


(x9) <a@ = inf p(x). 


Then {x € X | (x) < a} is a weak neighborhood of xp that is disjoint from K. Thus X~K is 
weakly open and therefore its complement in X, K, is weakly closed. U 


Corollary 27 Let K be a strongly closed convex subset of a normed linear space X. Suppose 
{x,} is a sequence in K that converges weakly to x € X. Then x belongs to K. 


Proof The weak limit of a sequence in K is a point of closure of K with respect to the weak 
topology. Therefore x belongs to the weak closure of K. But Mazur’s Theorem tells us that 
the weak closure of K is K itself. LJ 


Theorem 28 Let X be a reflexive Banach space. Then each strongly closed bounded convex 
subset of X is weakly sequentially compact. 


Proof Theorem 17 tells us that every bounded sequence in X has a weakly convergent 
subsequence. Therefore, by the preceding corollary, every sequence in K has a subsequence 
that converges weakly to a point in K. : LJ 


The following is a variation of the Banach-Saks Theorem; the conclusion is weaker, 
but it holds for general normed linear spaces. 


Theorem 29 Let X be a normed linear space and {xn} a sequence X that converges weakly 
to x € X. Then there is a sequence {z,} that converges strongly to x and each Zp is a convex 
combination of {xn, Xn+1, ..-}. 


Proof We argue by contradiction. If the conclusion is false, then there is a natural number 
n and an «€ > 0 for which, if we define Ko to be the set of all convex combinations of 
{Xn Xn41, ---}, then 
|x — z|| > € for all z € Ko. 

Define K to be the strong closure of Ky. Then x does not belong to K. The strong closure 
of a convex set is convex. Moreover, K is convex since Ko is convex. Therefore, by Mazur’s 
Theorem, K is weakly closed. Since {x,} converges to x with respect to the weak topology, 
x is a point of closure of K with respect to the weak topology. But a point of closure of a 
closed set belongs to the set. This contradiction concludes the proof. LJ 


The following theorem is a generalization of Corollary 18. 


Theorem 30 Let K be astrongly closed bounded convex subset of a reflexive Banach space X. 
Let the function f : K — R be continuous with respect to the strong topology on K and convex 
in the sense that foru,v € K and0 <A <1, 


f(Au+(1—A)v) SAf(u) + (1 -A)F(v). 


If f is bounded below on K, then f takes a minimum value on K. 
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Proof Define m to be the infimum of f(K). Choose a sequence {x,} in K such that { f(x, )} 
converges to m. According to Theorem 28, K is weakly sequentially compact. We may 
assume that {x,} converges weakly to x € K. Let « > 0. Choose a natural number NV 
such that 

m < f(x.) <m+eforallk >N. (20) 


Theorem 29 tells us that there is a sequence {z,} that converges strongly to x and each z, 
is a convex combination of {x7, X41, ...}. By the continuity of f with respect to the strong 
topology on K, {f(Zn)}—> f(x). On the other hand, from the convexity of f and (20), 


m < f(zn)<m-+e foralln > N. 


Therefore m < f(x) < m+. This holds for all « > 0 and hence f takes its minimum value 
on K at the point x. LJ 


PROBLEMS 


52. For each natural number n, let ¢, denote the sequence in £7 whose nth component is 1 
and other components vanish. Define E = {én +n -@m|n and m any natural numbers, m > n}. 
Show that 0 is a point of closure of E but no sequence in E converges weakly to 0. Consider 
the topological space X = EU{0} with the weak topology. Find a function f: X > R that fails 
to be continuous at 0 and yet has the property that whenever a sequence {x,} in E converges 
weakly to 0, its image sequence { f(x, )} converges to f(0). 


53. Find a subset of the plane R2 for which the origin is an internal point but not an interior point. 


54. Let X be a locally convex topological vector space and V a convex, symmetric with respect 
to the origin (that is, V = —V) neighborhood of the origin. If py is the gauge functional 
for V and yf is a linear real-valued functional on X such that Ww < py on X, show that w is 
continuous. 


55. Let X be a locally convex topological vector space, Y a closed subspace of X, and xo belong 
to X~Y. Show that there is a continuous functional : X — R such that 


(x0) #0 andy =OonyY. 


+6. Let X be a normed linear space and W a proper subspace of X* that separates points. Let 
belong to X*~W. Show that ker is strongly closed and convex but not W-weakly closed. 
(Hint: Otherwise, apply Corollary 26 with K = ker w.) 


57. Let X be a normed linear space. Show that the closed unit ball B* of X* is weak-« closed. 


58. Show that the Hyperplane Separation Theorem may be amended as follows: the point x) may 
be replaced by a convex set Ko that is disjoint from K and the conclusion is that K and Ko 
can be separated by a closed hyperplane if Ko is either compact or open. 


59. Show that the weak topology on an infinite dimensional normed linear space is not first 
countable. 


60. Show that every weakly compact subset of a normed linear space is bounded with respect to 
the norm. . 
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61. Let Y be a closed subspace of a reflexive Banach space X. For xy € X~Y, show that there is a 
point in Y that is closest to x9. 
62. Let X be normed linear space, W a finite dimensional subspace of X* and yw a functional in 


X*~W. Show that there is a vector x € X such that ys(x) #0 while g(x) = 0 for all g in W. 
(Hint: First show this is true if X is finite dimensional.) 


63. Let X be a normed linear space. Show that any dense subset of B* = { € xt lel] < 1} 
separates points in X. 


64. Complete the final part of the proof of Theorem 11. 


65. Find an example of a bounded subset A of a normed linear space X, F a set of functionals 
in X* containing Fy as a dense subset of F (dense in the sense of the norm topology on X*) 
such that ¥ and Fo generate different weak topologies for X, but the same weak topology 
for A. 


14.6 THE KREIN-MILMAN THEOREM 


Definition Let K be a nonempty convex subset 0 f a locally convex topological vector space X. 

A nonempty subset E of K is called an extreme subset? of K provided it is both convex and 
closed and whenever a vector x € E is a convex combination of vectors u and vin K, then both 
u and v belong to E. A point x € K is called an extreme point of K provided the singleton set 
{x} is an extreme subset of K. 


We leave it as two exercises to show that if the intersection of a collection of extreme 
subsets of K is nonempty, then the intersection is an extreme subset of K and, moreover, if A 
is an extreme subset of B and B is an extreme subset of K, then A is an extreme subset of K. 


Lemma 31 Let K be a nonempty, compact, convex subset of a locally convex topological 
vector space X and : X — R be linear and continuous. Then the set of points in K at which 
W takes its maximum value on K is an extreme subset of K. 


Proof Since K is s compact and w is continuous, y takes a maximum value, m, on K. The 
subset M of K on which y takes its maximum value is closed, since is continuous, and is 
convex since if is linear. Let x € M be a convex combination of vectors u, v in K. Choose 
0 <A <1 for which x = Au + (1 — A)v. Since 


W(u) <m,y(v) < mand m = (x) = Ap(u) + (1— AyW(v), 
we must have W(u) = W(v) =m, that is, u, ve M. | | Oo 


The Krein-Milman Lemma Let K be a nonempty, compact, convex subset of a locally convex 
topological vector space X. Then K has an extreme point. 


Proof The strategy of the proof is first to apply Zorn’s Lemma to find an extreme subset E 
of K that contains no proper subset which also is an extreme subset of K. We then infer from 
the Hyperplane Separation Theorem and the preceding lemma that E is a singleton set. 


Consider the collection ¥ of extreme subsets of K. Then F is nonempty since it 
contains K. We order ¥ by containment. Let Fo C F be totally ordered. Then Fo has the — 


> An extreme subset of K is also often called a supporting set for K. | 
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finite intersection property since for any finite subcollection of Fo, because Fo is totally 
ordered, one of the subsets of this finite subcollection is contained in all the others and 
therefore the intersection is nonempty. Thus Fo is a collection of nonempty closed subsets 
of the compact set K which has the finite intersection property. Hence if we let Ep be the 
intersection of the sets in Fp, Ep is nonempty. As already observed, Ep is an extreme subset 
of K since it is the nonempty intersection of such sets. Thus Eo is a lower bound for Fo. 
Therefore every totally ordered subcollection of F¥ has a lower bound and hence, by Zorn’s 
Lemma, F has a minimal member, that is, there is an extreme subset E of K which contains 
no proper extreme subset. 

We claim that E is a singleton set. Indeed, otherwise we may select two points u and v in 
K. It follows from the Hahn-Banach Theorem that there is ay € X* for which #(u) <(v). 
According to Lemma 3t, the subset M of E on which w takes its maximum value on E is 
an extreme subset of E. Since E is an extreme subset of K, M is also an extreme subset of 
K. Clearly u ¢ M, and therefore M is a proper subset of E. This contradicts the minimality 
of E. Thus E is a singleton set and therefore K has an extreme point. L 


Definition Let K be a subset of a locally convex topological vector space X. Then the closed 
convex hull of K is defined to be the intersection of all closed convex subsets of X that 
contain K. 


We infer from Mazur’s Theorem that in a normed linear space, the weakly closed 
convex hull of a set equals its strongly closed convex hull. It is clear that the closed convex 
hull of a set K is a closed convex set that contains K and that is contained in any other closed 
convex set that contains K. 


The Krein-Milman Theorem Let K be a nonempty, compact, convex subset of a locally 
convex topological vector space X. Then K is the closed convex hull of its extreme points. 


Proof By the Krein-Milman Lemma, the set E of extreme points of K is nonempty. Let C 
be the closed convex hull of E. If K #C, choose x9 € K~C. By the Hyperplane Separation 
Theorem, since C is convex and closed, there is a continuous linear functional »: X > R 
such that 


(x9) > max (x) > maxy(x). (21) 


By Lemma 31, if m is the maximum value taken by # on K, then M = {x € K|#(x) =m} is 
an extreme subset of K. By the Krein-Milman Lemma, applied now with K replaced by the 
nonempty compact convex set M, there is a point z € M that is an extreme point of M. As we 
already observed, an extreme point of an extreme subset of K is also an extreme point of K. 
We infer from (21) that #(z) > (x9) > #(z). This contradiction shows that K = C. CJ 


There are many interesting applications of the Krein-Milman Theorem.° In Chapter 22, 
this theorem is used to prove the existence of ergodic measure preserving transformations. 
Louis de Branges has used this theorem to provide an elegant proof of the Stone-Weirstrass 
Theorem (see Problem 53 of Chapter 21). 


®See Peter Lax’s Functional Analysis [Lax97]. 
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Remark To apply the Krein-Milman Theorem it is necessary to establish criteria for identifying 
which subsets of a locally convex topological space are compact. In particular, to identify 
which convex subsets of a normed linear space X are weakly compact and which convex 
subsets of its dual space X* are weak-« compact. In the next chapter we prove a theorem of 
Alaoglu, which tells us that the closed unit ball of the dual of a normed linear space X is 
weak-* compact. Therefore every bounded convex subset of X* that is weak-* closed is weak-x 
compact. From Alaoglu’s Theorem and Mazur’s Theorem we then infer that any strongly 
closed bounded convex subset of a reflexive Banach space is weakly compact. 


| PROBLEMS 
66. Find the yen points of each of the following subsets of the plane R?: 
(i) {(x, y) |x? +y° = 1; (ii) ((x, y) Lal + Lvl = 35 (iii) (x, y) | max{x, y} = 1). 
67. In each of the following, B denotes the closed unit ball of a normed linear space X. 
(i) Show that the only possible extreme points of B have norm 1. 
(ii) If X = L?[a, b], 1 < p < oo, show that every unit vector in B is an extreme point of B. 


(iii) If X = L™[a, b], show that the extreme points of B are those functions f € B such that 
|f| =1a.e. on [a, 5]. 


(iv). If X = L'[a, b], show that B fails to have any extreme points. 
(v) If X =1?,1 < p< 0, what are the extreme points of B? 


(vi) If X = C(K), where K is a compact Hausdorff topological space and X is normed by the 
maximum norm, what are the extreme points of B? 


68. A norm ona linear space is said to be strictly convex provided whenever u and v are distinct 
unit vectors and 0 < A <1, then ||Aw + (1 — A)v|| < 1. Show that the Euclidean norm on R” 
and the usual norm on L?[a, b], 1 < p < o are strictly convex. 


69. Let X be a reflexive Banach space with a strictly convex norm and K a nonempty closed 
convex subset of X. For z € X~K, use the reflexivity of X to show that there is a point x9 € K 
that is closest to z in the sense that 


lz — xoll < lx — zl for all x € K. 


Then use the strict convexity of the norm to show that xo is unique and is an extreme point 
of K. 


CHAPTER = 15 


Compactness Regained: 
The Weak Topology 
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We proved a theorem of Riesz which asserts that the closed unit ball of an infinite dimensional 
normed linear space fails to be compact with respect to the strong topology induced by 
the norm. In this chapter we prove a precise theorem regarding the manner in which, for 
an infinite dimensional Banach space, compactness of the closed unit ball is regained with 
respect to the weak topology. We prove that if B is the closed unit ball of a Banach space X, 
then the following are equivalent: 


(i) X is reflexive; 
(ii) Bis weakly compact; 
(iii) Bis weakly sequentially compact. 


The first compactness result we establish is Alaoglu’s Theorem, an extension of Helley’s 
Theorem to non-separable spaces, which tells us that for a normed linear space X, the closed 
unit ball of the dual space X* is compact with respect to the weak-« topology. This direct 
consequence of the Tychonoff Product Theorem enables us to use the natural embedding 
of a Banach space in its bidual, J: X — X**, to prove the equivalences (i)-(ii). What is 
rather surprising is that, for the weak topology on B, sequential compactness is equivalent 
to compactness despite the fact that in general the weak topology on B is not metrizable. 


15.1 ALAOGLU’S EXTENSION OF HELLEY'S THEOREM 
Let X be a normed linear space, B its closed unit ball and B* the closed unit ball of its dual 
space X*. Assume X is separable. Choose {x,} to be a dense subset of B and define 


OO 


1 
pb, 0) = 2 ae —0)(an)I for all p, 1 € BY. 


Then p is a metric that induces the weak-« topology on B* (see Corollary 11). For a metric 
space, compactness is the same as sequential compactness. Therefore Helley’s Theorem may 
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be restated as follows: If X is a separable normed linear space, then the closed unit ball B* of 
its dual space X* is weak-« compact. We now use the Tychonoff Product Theorem to show 
that the separability assumption is not needed. 

Recall a special case of the Tychonoff Product Theorem: Let A be any set. Consider 
the collection F(A) of all real-valued functions on A that take values in the closed, bounded 
interval [—1, 1]. Consider F(A) as a topological space with the product topology. A base at 
f € F(A) for the product topology on F(A) comprise sets of the form 


where € > 0 and the A;’s belong to A. The Tychonoff Product Theorem implies that the 
topological space consisting of F(A) with the product topology is compact. Therefore every 
closed subset Fo(A) of F(A), with the topology induced by the product topology, also is 
compact. 


Alaoglu’s Theorem Let X be a normed linear space. Then the closed unit ball B* of its dual 
space X* is compact with respect to the weak-* topology. 


Proof Denote the closed unit balls in X and X* by B and B*, respectively. By the preceding 
discussion, the topological space F(B) consisting of functions from B to [—1, 1], with the 
product topology, is compact. 


Define the restriction map R: B* > F(B) by R(w) = lg for € B*. We claim 
that (i) R(B*) is a closed subset of ¥(B) and (ii) the restriction map R is a topological 
homeomorphism from B*, with the weak-+ topology, onto R( B* ), with the product topology. 
Suppose, for the moment, that (i) and (ii) have been established. By the preceding discussion, 
the Tychonoff Product Theorem tells us that R(B*) is compact. Therefore any space 
topologically homeomorphic to R(B*) is compact. In particular, by (ii), B* is weak-« 
compact. 

It remains to verify (i) and (ii). First observe that R is one-to-one since for y, 7 € B*, 
with ¢ #7, there is some x € B for which w(x) #7(x) and thus R(y) # R(n). A direct 
comparison of the basic open sets in the weak-« topology with basic open sets in the product 
topology reveals that R is a homeomorphism of B* onto R(B*). It remains to show that 
R( B*) is closed with respect to the product topology. Let f: B — [—1, 1] be a point of 
closure, with respect to the product topology, of R( B*). To show that f € R(B*) it suffices 
(see Problem 1) to show that for all u, v € Band A € R for whichu-+v and Au also belong to B, 


flut+v) = f(u) + f(v) and f(Au) =Af(u). (1) 


However, for any € > 0, the weak-* neighborhood of f, Nc.u,v,u+v( f), contains some R(, ) 
and since ¢ is linear, we have | f(u + v) — f(u) — f(v)| <3e. Therefore, the first equality 
in (1) holds. The proof of the second is similar. O 


Corollary 1 Let X be anormed linear space. Then there is a compact Hausdorff space K for 
which X is linearly isomorphic to a linear subspace of C(K), normed by the maximum norm. 


Proof Let K be the closed unit ball of the dual space, with the weak-« topology. Alaoglu’s 
Theorem tells us that K is compact and it certainly is Hausdorff. Define ®: X > C(K) by 
®(x) = J(x)|x. Since the natural embedding J: X > X** is an isometry, so is ®. C 
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Corollary 2 Let X be a normed linear space. Then the closed unit ball B* of its dual space X* 
possesses an extreme point. 


Proof We consider X* as a locally convex topological space with its weak-« topology. 
According to Alaoglu’s Theorem, B* is convex and compact. The Krein-Milman Lemma 
tells us that B* possesses an extreme point. O 


Remark Alaoglu’s Theorem does not tell us that the closed unit ball of the dual of a normed 
linear space is sequentially compact with respect to the weak-* topology. For instance, for 
X = €™, B*, the closed unit ball of X*, is not weak-« sequentially compact. Indeed, the 
sequence {is,} C B* defined for eachn by 


Wn ({xx}) = Xx» for all {x;,} € | 


fails to have a weak-« convergent subsequence. Alaoglu’s Theorem is a generalization of 
Helley’s Theorem from the viewpoint of compactness, not sequential compactness. By Helley’s 
Theorem, B* is weak-« sequentially compact if X is separable, and the forthcoming Corollary 
6 tells us that B* also is weak-« sequentially compact if X is reflexive. 


PROBLEMS 


1. For X a normed linear space with closed unit ball B, suppose the function f: B > 
[—1, 1] has the property that whenever u,v, u + v, and Au belong to B, f(u+v) = f(u) + 
f(v) and f(Au) = Af(u). Show that f is the restriction to B of a linear functional on all of 
X which belongs to the closed unit ball of X*. 


2. Let X be a normed linear space and K be a bounded convex weak-« closed subset of X*. 
Show that K possesses an extreme point. 


3. Show that any nonempty weakly open set in an infinite dimensional normed linear space is 
unbounded with respect to the norm. 


4. Use the Baire Category Theorem and the preceding problem to show that the weak topology 
on an infinite dimensional Banach space is not metrizable by a complete metric. 


5. Is every Banach space isomorphic to the dual of a Banach space? 
15.2 REFLEXIVITY AND WEAK COMPACTNESS: KAKUTANI’S THEOREM 


Proposition 3. Let X be a normed linear space. Then the natural embedding J: X > X** 
is a topological homeomorphism between the locally convex topological vector spaces X and 
J(X), where X has the weak topology and J( X) has the weak-x topology. 


Proof Let xo belong to X. A neighborhood base for the weak topology at xp € X is defined 
by sets of the form, for « > 0 and Wj, ..., B,_, € X*, 


geeeg 


Now [J(x) — J(xo) |W = W(x — x9) for each x € X and 1 <i <n, and therefore 


HN ebay (20)) = I(X)O Ned... (J (%0)). 
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Therefore J maps a base for the weak topology at the origin in X onto a base for the weak- 
topology at the origin in J( X ). Thus J is ahomeomorphism from X, with the weak topology, 
onto J(X), with the weak-« topology. O 


Kakutani’s Theorem <A Banach space is reflexive if and only if its closed unit ball is weakly 
compact. 


Proof Let X be a Banach space. Denote the closed unit balls in X and X** by B and B**, 
respectively. Assume X is reflexive. The natural embedding is an isomorphism and therefore 
J is a one-to-one map of B onto B**. On the other hand, according to Proposition 3, J is 
a homeomorphism from B, with the weak topology, onto B**, with the weak-* topology. 
But by Alaoglu’s Theorem, applied with X replaced by X*, B** is weak-* compact, so any 
topological space homeomorphic to it also is compact. In particular, B is weakly compact. 


Now assume B is weakly compact. The continuous image of compact topological 
Spaces is compact. We infer from Proposition 3 that J(B) is compact with respect to the 
weak-x topology. Of course, J(B) is convex. To establish the reflexivity of X, we argue by 
contradiction. Assume X is not reflexive. Let T belong to B**~J(B). Apply Corollary 26 of 
the Hyperplane Separation Theorem in the case that X is replaced by X* and W = J( X*). 
Thus there is a functonal & € X* for which ||y|| = 1 and 


T(w) <inf ses(B)S(#) = inf ,<pip(x). 


The right-hand infimum equals —1, since ||¥|| = 1. Therefore T(y) < —1. This is a 
contradiction since ||T|| < 1 and ||#|| = 1. Therefore X is reflexive. LJ 


Corollary 4 Every closed, bounded, convex subset of a reflexive Banach space is weakly 
compact. 


Proof Let X be a Banach space. According to Kakutani’s Theorem, the closed unit ball of 
X is weakly compact. Hence so is any closed ball. According to Mazur’s Theorem, every 
closed, convex subset of X is weakly closed. Therefore any closed, convex, bounded subset 
of X is a weakly closed subset of a weakly compact set and hence must be weakly compact. LJ 


Corollary 5 Let X be a reflexive Banach space. Then the closed unit ball of its dual space, B*, 
is sequentially compact with respect to the weak-* topology. 


Proof Since X is reflexive, the weak topology on B* is the same as the weak-x topology. 
Therefore, by Alaoglu’s Theorem, B* is weakly compact. We infer from Kakutani’s Theorem 
that X* is reflexive. We therefore infer from Theorem 17 of the preceding chapter that every 
bounded sequence in X* has a weak-* convergent subsequence. But B* is weak-* closed. 
Thus B* is sequentially compact with respect to the weak-x topology. LJ 


PROBLEMS 


6. Show that every weakly compact subset of a normed linear space is bounded with respect to 
the norm. 


7. Show that the closed unit ball B* of the dual X* of a Banach space X has an extreme point. 
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8. Let 7, and Tz be two compact, Hausdorff topologies on a set S for which 7; C T2. Show 
that 7, = 7. 
9. Let X be a normed linear space containing the subspace Y. For A C Y, show that the weak 


topology on A induced by Y* is the same as the topology A inherits as a subspace of X with 
its weak topology. 


10. Argue as follows to show that a Banach space X is reflexive if and only if its dual space X* is 
reflexive. 
(i) If X is reflexive, show that the weak and weak-x topologies on B* are the same, and infer 
from this that X* is reflexive. 


(ii) If X* is reflexive, use Problem 8 to show that the weak and weak-« topologies on B* are 
the same, and infer from this and Proposition 6 of Chapter 14 that J(X) = X**. 


11. For X a Banach space, by the preceding problem, if X is reflexive, then so in X*. Conclude 
that X is not reflexive if there is a closed subspace of X* that is not reflexive. Let K be an 
infinite compact Hausdorff space and {x,} an enumeration of a countably infinite subset of 
K. Define the operator T : ' — [C(K)]* by 


[T((md IF) = Se (xe) for all (ng) € 1! and f € C(k). 
k=1 


Show that T is an isometry and therefore, since /' is not reflexive, neither is T(J!) and 
therefore neither is C( K). Use a dimension counting argument to show that C( K) is reflexive 
if K is a finite set. 


12. If Y is a linear subspace of a Banach space X, we define the annihilator Y+ to be the subspace 
of X* consisting of those y € X* for which # = 0 on Y. If ¥ is a subspace of X*, we define Y° 
to be the subspace of vectors in X for which w(x) = 0 for all w € Y. 
(i) Show that Y+ is a closed linear subspace of X*. 


(ii) Show that (¥+)° = Y. 
(iii) If X is reflexive and Y is subspace of X*, show that Yt = J(Y°). 


15.3. COMPACT NESS AND WEAK SEQUENTIAL COMPACTNESS: 
THE EBERLEIN-SMULIAN THEOREM 


Theorem 6 (Goldstine’s Theorem) Let X be a normed linear space, B the closed unit ball of 
X, and B** the closed unit ball of X**. Then the the weak-« closure of J(B) is B**. 


Proof According to Corollary 9 of the preceding chapter, J is an isometry. Thus J(B) C B**. 
Let C be the weak-x closure of J(B). We leave it as an exercise to show that B** is weak-« 
closed. Thus C C B**. Since B is convex and J is linear, J(B) is convex. Proposition 22 
of the preceding chapter tells us that, in a locally convex topological vector space, the 
closure of a convex set is convex. Thus C is a convex set that is closed with respect to the 
weak-« topology. Suppose C # B**. Let T belong to B**~C. We now invoke the Hyperplane 
Separation Theorem in the case that X is replaced by (X*)* and (X*)* is considered as a 
locally convex topological vector space with the weak-« topology; see Corollary 26 of the 
preceding chapter. Thus there is some  € X* for which ||y|| = 1 and 


T(W) <inf secS(p). (2) 
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Observe that since C contains J(B), 


inf secS() < inf yep W(x) = -1. 


Therefore T T() < —1. This is a contradiction since 7] < 1 and |p| = 1. Therefore 
C= J(B) and the proof of is complete. 0 


Lemma 7 Let X be a normed linear space and W a finite dimensional subspace of X*. Then 
there is a finite subset F of X for which 


W/2.< maxy(x) for all y € W. 3) 


Proof Since W is finite dimensional, its closed unit sphere S* = {h € W||l¥|| = 1} is 
compact and therefore is totally bounded. Choose a finite subset {u1, ..., Wn} of S* for which 
S* C Ul_, B(w, 1/4). For 1 < k <n, choose a unit vector x; in X for which We ( xz) > 3/4. 
Let & belong to S*. Observe that 


(xk) = Were) + Lb — dalxe > 3/4 + [yb — delax for 1 < k <n. 


If we choose k such that || — w,|l < 1/4, then since ||xz|| = 1, &(x,) = 1/2 = 1/2]. Thus 
(3) holds if F = {xj,..., xz} and w € W has ||| = 1. It therefore holds for all y € W. L] 


Theorem 8 (the Eberlein-Smulian Theorem) Let B be the closed unit ball of a Banach space 
X. Then B is weakly compact if and only if it is weakly sequentially compact. 


Proof We first assume B is compact. Kakutani’s Theorem tells us that X is reflexive. 
According to Theorem 17 of the preceding chapter, every bounded sequence in X has a 
weakly convergent subsequence. Since B is weakly closed, B is weakly sequentially compact. © 


To prove the converse, assume B is weakly sequentially compact. To show that B is 
compact it suffices, by Kakutani’s Theorem, to show that X is reflexive. ! Let T belong to 
B**. Goldstine’s Theorem tells us that T belongs to the weak-x closure of J( B). We will use 
the preceding lemma to show that T belongs to J( B). 


Choose # € B*. Since T belongs to the weak-« closure of J(B), we may choose 
x € B for which J(x;) belongs to N',y,,(T). Define N(1) = 1 and W; = span{{T, J(x1)}] C 
X**, Let n be a natural number for which there has been defined a natural number 
N(n), a subset {xx}i<e<n of B, a subset {x}1<p<n(n) of X* and we have defined W, = 
span[{T, J(x,), ..., J(%)}]. Since T belongs to the weak-« closure of J(B), we may 
choose x,41 € B for which 


J(Xn41) € N4/(n4h) hy ntom(ny (2) | (4) 


Define | 
Wr+i = Span{{T, J(x1), ..., J(%n41)}]- (5) 


IThis elegant proof that sequential compactness of the closed unit ball implies reflexivity is due to R. J. Whitley, 
“An elementary proof of the Eberlein-Smulian Theorem,” Mathematische Annalen, 1967. 
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We infer from the preceding lemma, in the case X is replaced by X*, that there is a natural 
number N(n + 1) > N(n) and a finite subset {Wi} v(n)<k<n(n41) Of X* for which 
SI/2< max S for all S € W,,41. 6 
ISIl/ w(n) oe nt) (Wx) +1 (6) 
We have therefore inductively defined a strictly increasing sequence of natural numbers 
{N(n)}, a sequence {x,} in B, a sequence {w,,} in X*, and a sequence {W,,} of subspaces of 


X** for which (4) and (6) hold. Since {W,,} is an ascending sequence for which (6) holds for 
every index n, 


|SI/2 < sup y<p<ooS(W ) for all Se W=span[{T, J(x1), ..., I(x), ---}]. (7) 
Since (4) holds for all n, 
\(T — J(Xm))[Wall <1/m ifn <N(m), 8) 


Since B is sequentially compact, there is a subsequence {x,,} of {x,} that converges weakly 
to x € B. Mazur’s Theorem tells us that a sequence of convex combinations of the terms of 
the sequence {x,,} converges strongly to x. The image under J of this sequence of convex 
combinations converges strongly to J(x) in X**. Thus J(x) belongs to W. But T also belongs 
to W. Therefore T — J(x) belongs to W. We claim that T = J(x). In view of (7) to verify 
this claim it is necessary and sufficient to show that 


(T — J(x))[W,] = 0 for all n. (9) 
Fix a natural number n. Observe that for each index k, 


(T — J(x) ln] = (T — FC tng) [Yn] + (Fm) — F(x) al: 
We infer from (8) that if N(nz) >n, then |[(T — J(xn, ))[Wn]| < 1/m,. On the other hand, 


(J 0%.) — F(x) in] = Yn (Xn, — x) for all k 


and {x,,} converges weakly to x. Thus 


(T —J(x) bn] = im (7 — Fam) Ln] + Lim (F(%ng) — F(a) [Yn] = 0. Z 


We gather Kakutani’s Theorem and the Eberlein-Smulian Theorem into the following 
statement. 


Characterization of Weak Compactness Let B be the closed unit ball of a Banach space X. 
Then the following three assertions are equivalent: 
(i) X is reflexive; 
(ii) Bis weakly compact; 
(iii) Bis weakly sequentially compact. 
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PROBLEMS 
13. In a general topological space that is not metrizable a sequence may converge to more than 
one point. Show that this cannot occur for the W-weak topology on a normed linear space X, 
where W is a subspace of X* that separates points in X. 


14. Show that there is a bounded sequence in L™{0, 1] that fails to have a weakly convergent 
subsequence. Show that the closed unit ball of C[a, b] is not weakly compact. 


15. Let K be a compact, metric space with infinitely many points. Show that there is a bounded 
sequence in C( K ) that fails to have a weakly convergent subsequence (see Problem 11), but 
every bounded sequence of continuous linear functionals on C(K) has a subsequence that 
converges pointwise to a continuous linear functional on C( K). 


15.4 METRIZABILITY OF WEAK TOPOLOGIES 


If the weak topology on the closed unit ball of a Banach space is metrizable, then the 
Eberlein-Smulian Theorem is an immediate consequence of the equivalence of compactness 
and sequential compactness for a metric space. To better appreciate this theorem, we now 
establish some metrizable properties of weak topologies. The first theorem presents a good 
reason why analysts should not just stick with metric spaces. 


Theorem 9 Let X be an infinite dimensional normed linear space. Then neither the weak 
topology on X nor the weak-x topology on X* is metrizable. 


Proof To show that the weak topology on X is not metrizable, we argue by contradiction. 
Otherwise, there is a metric p: X X X — [0, 00) that induces the weak topology on X. Fix a 
natural number n. Consider the weak neighborhood {x € X | p(x, 0) <1/n} of 0. We may 
choose a finite subset F,, of X* and €, > 0 for which 


{xe X]| |W(x)| <e, for alle F,}C {xe X | p(x, 0) <1/n}. 
Define W,, to be the linear span of F,,. Then 
Nyew, ker C {xe X| p(x, 0) < 1/n}. (10) 


Since X is infinite dimensional, it follows from the Hahn-Banach Theorem that X* also is 
infinite dimensional. Choose , €¢ X*~W,,. We infer from Proposition 4 of the preceding 
chapter that there is an x, € X for which yf, (x, ) #0 while (x, ) = 0 for all wy € F,. Define 
Un = N- Un/|\up||. Observe that ||u,,|| =n and, by (10), that p(u,, 0) < 1/n. Therefore {u,} 
is an unbounded sequence in X that converges weakly to 0. This contradicts Theorem 12 of 
the preceding chapter. Therefore the weak topology is not metrizable. 


To prove that the weak-« topology on X* is not metrizable, we once more argue 
by contradiction. Otherwise, there is a metric p*: X* X X* —> [0, 00) that induces the 
weak-« topology on X*. Fix a natural number n. Consider the weak-*« neighborhood 
{y € X*| p*(y, 0) < 1/n} of 0. We may choose a finite subset A, of X and €, > 0 for which 


{ye X* | |W(x)| <e, for all x € An} C {yp © X*| p*(W, 0) <1/n}. 
Define X,, to be the linear span of A,. Then 
{ye X* | p(x) =O forall x e X,} C {pe X* | p*(W, 0) <1/n}. (11) 
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Since X,, is finite dimensional, it is closed and is a proper subspace of X since X is infinite 
dimensional. It follows from Corollary 11 of the preceding chapter that there is a nonzero 
functional y, € X* which vanishes on X,,. Define g, =n - Yn/ |p|]. Observe that |ly, || =n 
and, by (11), that p*(¢g,, 0) < 1/n. Therefore {g,} is an unbounded sequence in X* that 
converges pointwise to 0. This contradicts the Uniform Boundedness Theorem. Thus the 
weak-« topology on X* is not metrizable. O 


Theorem 10 Let X be anormed linear space and W a separable subspace of X* that separates 
points in X. Then the W-weak topology on the closed unit ball B of X is metrizable. 


Proof Since W is separable, B* N W also is separable, where B* is the closed unit ball of X*. 
Choose a countable dense subset {,}7°, of B* N W. Define p: BX B > R by 
<1 
plu, v) = Py ak lW.(u — v)| for allu, ve B. 

This is properly defined since each yy, belongs to B*. We first claim that p is a metric on B. 
The symmetry and triangle inequality are inherited by p from the linearity of y;,’s. On the 
other hand, since W separates points in X, any dense subset of S* M W also separates points 
in X. Therefore, for u, v € B with u 4 v, there is a natural number k for which y;,(u — v) #0 
and therefore p(u, v) > 0. Thus p is a metric on B. Observe that for each natural number n, 
since each ys; belongs to B*, 


k=1 k=1 


We leave it as an exercise to infer from these inequalities and the denseness of {Wy}, in 
B’ 1 W that {x « B{ p(x, 0) <1/n}_, is a base at the origin for the W-weak topology on B. 
Therefore the topology induced by the metric p is the W-weak topology on B. LO 


Corollary 11 Let X be a normed linear space. 


(i) The weak topology on the closed unit ball B of X is metrizable if X* is separable. 
(i) The weak-« topology on the closed unit ball B* of X* is metrizable if X is separable. 


Theorem 12 Let X be a reflexive Banach space. Then the weak topology on the closed unit 
ball B is metrizable if and only if X is separable. 


Proof Since X is reflexive, Theorem 14 of the preceding chapter tells us that if X is 
separable, so is X*. Therefore, by the preceding corollary, if X is separable, then the weak 
topology on B is metrizable. Conversely, suppose the weak topology on B is metrizable. Let 
p: BX B — {0, oo) be a metric that induces the weak topology on B. Let n be a natural 
number. We may choose a finite subset F,, of X* and €, > 0 for which 


{xe€B| |W(x)|<e for ally e F,}C {xe B| p(x, 0) <1/n}. 


Therefore 


() ker) NBC {xe B| p(x, 0) <1/n}. (13) 
WeFn 
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Define Z to be the closed linear span of U™, F,. Then Z is separable since finite linear 
combinations, with rational coefficients, of functionals inU°C, F, is accountable dense subset 
of Z. We claim that Z = X*. Otherwise, Corollary 11 of the preceding chapter tells us that 
there is a nonzero S§ € (X*)*, which vanishes of Z. Since X is reflexive, there is some xp € X 
for which S = J(x9). Thus xp #0 and yi (x0) = 0 for all k. According to (13), p(xo, 0) <1/n 
for all n. Hence xo #0 but p(xo, 0) = 0. This is a contradiction. Therefore X* is separable. 
Theorem 13 of the preceding chapter tells us that X also is separable. LJ 


PROBLEMS 
16. Show that the dual of an infinite dimensional normed linear space also is infinite dimensional. 


17. Complete the last step of the proof of Theorem 10 by showing that the inequalities (12) imply 
that the metric p induces the W-weak topology. | 


18. Let X be a Banach space, W a closed subspace of its dual X*, and yp belong to X*~W. Show 
that if either W is finite dimensional or X is reflexive, then there is a vector xg in X for which 
Wo(xo) #0 but &(xo) = 0 for all w € W. Exhibit an example of an infinite dimensional closed 
subspace W of X* for which this separation property fails. 


CHAPTER § 16 
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on Hilbert Spaces 
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The inner product (u, v) of two vectors u = (u;,..., Un) and v € (vj,..., ¥,) in Euclidean 
space R" is defined by 
n 
(u, v) = S ULV ;. 
k=l 
We call this the Euclidean inner product. The Euclidean norm || - || is determined by the 
relation 


|u|] = y(u, u) for allu € R”. 


With respect to the Euclidean inner product there is the important notion of orthogonality, 
which brings a geometric viewpoint to the study of problems in finite dimensional spaces: 
subspaces have orthogonal complements and the solvability of systems of equations can be 
determined by orthogonality relations. The inner product also brings to light interesting 
classes of linear operators that have quite special structure: prominent among these are 
the symmetric operators for which there is a beautiful eigenvector representation. In this 
chapter we study Banach spaces H that have an inner product that is related to the norm 
as it is in the Euclidean spaces. These spaces are called Hilbert spaces. We show that if V 
is a closed subspace of a Hilbert space H, then H is the direct sum of V and its orthogonal 
complement. Based on this structural property, we prove the Riesz-Fréchet Representation 
Theorem, which characterizes the dual space of a Hilbert space. From this we infer, using 
Helley’s Theorem, that every bounded sequence in a Hilbert space has a weakly convergent 
subsequence. We prove Bessel’s Inequality from which we infer that a countable orthonormal 
set is an orthonormal basis if and only if its linear span is dense. The chapter concludes with 
an examination of bounded symmetric operators and compact operators on a Hilbert space, 
in preparation for the proof of two theorems: the Hilbert-Schmidt Theorem regarding an 
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eigenvalue expansion for compact symmetric operators and the Riesz-Schauder Theorem 
regarding the Fredholm properties of compact perturbations of the identity operator. 


16.1 THE INNER PRODUCT AND ORTHOGONALITY 


Definition Let H be a linear space. A function (-, -): H X H > R is called an inner product 
on H provided for all x1, x2,x and y € H and real numbers a and B, 


(i) (ax, + Bx2, y) =a(x1, y) + B(x2, y), 
(it) (x, y) = ({y, x), 
(iii) (x, x) >Oifx#0. 


A linear space H together with an inner product on H is called an inner product space. 
Property (ii) is called symmetry. From (i) and (ii) it follows that (x, ay; + By?) = 
a(x, yi) + B(x, yz): this property, together with (1), is called bilinearity. 


Among infinite dimensional linear spaces two examples of inner product spaces come to 
mind. For two sequences x = {x,} and y = {yz} € £”, the é* inner product, (x, y), is defined by 


(o@) 
(x, y) = > Xk Vk- 
k=1 


For E a measurable set of real numbers and two functions f and g € L?(E), the L? inner 
product, (f, g), is defined by 


ina=| fee 


where the integral is with respect to Lebesgue measure. 
In Chapter 7 we obtained the Cauchy-Schwartz Inequality for L*( E) as a special case 
of Hélder’s Inequality. This inequality holds for any inner product space. 


The Cauchy-Schwarz Inequality For any two vectors u, v in an inner product space H, 
\(u, v)| < [lull - lull. 
To verify this, observe that 
0 < lu + roll? = lull? + 2t(u, v) +r? lull? for allt € R. 


The quadratic polynomial in t defined by the right-hand side fails to have distinct real roots 
and therefore its discriminant is not positive, that is, the Cauchy-Schwarz Inequality holds. 


Proposition 1 For a vector h in an inner product space H, define 
[Al] = yh, A). 


Then || - || is anorm on H called the norm induced by the inner product (-, -). 
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Proof The only property of a norm that is not evident for || - || is the triangle inequality. This, 

however, is a consequence of the Cauchy-Schwarz Inequality since, for two vectors u, v € H, 

lu + vl = (ut v, w+ v) = fall? +20, v) + [fol]? < ffl? + 2a oll + rol? = (ell + Hall). 
LJ 


The following identity characterizes norms that are induced by an inner product; see 
Problem 7. 


The Parallelogram Identity For any two vectors u, v in an inner product space H, 
lu — ull? + lle + vl? = Ql]? + 2p. 
To verify this identity just add the following two equalities: 
iu — ull? = lull? — 2(u, v) + [lols 
iu + ull? = [lull + 2(u, v) + [lvl?. 


Definition An inner product space H is called a Hilbert space provided it is a Banach space 
with respect to the norm induced by the inner product. 


The Riesz-Fischer Theorem tells us that for E a measurable set of real numbers, L?(E) 
is a Hilbert space and, as a consequence, so is 02. 


Proposition 2 Let K be anonempty closed convex subset of a Hilbert space H and ho belong 
to H ~ K. Then there is exactly one vector h, € K that is closest to ho in the sense that 


[40 — hy|| = dist(ho, K) = inf ||ho — hl. 
heK 


Proof By replacing K by K — ho, we may assume that ho = 0. Let {h,} be a sequence in K 
for which 


fm. [hall = int (lA. (1) 
We infer from the parallelogram identity and the convexity of K that for each m and n, 


2 2 2 


h,+h 
Wall? + lAmll? = 2 ~~ 


hn — hm hn —hn 
2 2 


2 2-1 h 2- 2 
+ > inf || | + (2) 


From (1) and (2) we infer that {h,} is a Cauchy sequence. Since H is complete and K is 
closed, {h,} converges strongly to h* € K. By the continuity of the norm, ||h*|| = inf heK ||All. 
This point in K that is closest to the origin is unique. Indeed, if h* is another vector in K that 
is closest to the origin, then, if we substitute h* for h, and h, for h», in inequality (2), we have 


O> |[h* ||? + hall? —2- inf Al] > 2- 
heK 


h* — h, 
2 


Thus h* = h,. L] 
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Definition Two vectors u, v in the inner product space H are said to be orthogonal provided 
(u, v) = 0. A vector u is said to be orthogonal to a subset S of H provided it is orthogonal to 
each vector in S. We denote by S+ the collection of vectors in H that are orthogonal to S. 


We leave it as an exercise to infer from the Cauchy-Schwarz Inequality that if S is 
a subset of an inner product space H, then S+ is a closed subspace of H. The following 
theorem is fundamental. : 


Theorem 3 Let V be a closed subspace of a Hilbert space H. Then H has the orthogonal 
direct sum decomposition | 
H=VOV". (3) 


| , ; 
Proof Let ho belong to H ~ V. The preceding proposition tells us there is a unique vector 
h* € V that is closest to ho. Let h be any vector in V. For a real number f, since V is a linear 
space, the vector h* — th belongs to V and therefore 


(to—h*, ho—h*) = l\o—h* |? < Ilho—(h*—th) ||? = (ho—h*, ho—h*)-+2t-(ho—h*, h)+22(h, h). 


Hence 
0 <2t- (ho —h*, h) +¢7(h, h) forallt € R, 


and therefore (ho — h*, h) = 0. Thus the vector ho — h* is orthogonal to V. Observe that 
ho = h* + [ho — h*]. We conclude that H = V+ V+ andsince VN V+ = {0}, H=>VOV-L.O 


We leave the proof of the following corollary as an exercise. 


Corollary 4 Let S be a subset of a Hilbert space H. Then the closed linear span of S is all of 
H if and only if S+ = {0}. 


In view of (3), for a closed subspace V of H, we call V+ the orthogonal complement of 
V in H and refer to (3) as an orthogonal decomposition of H. The operator P € £( H) that 
is the projection of H onto V along V+ is called the orthogonal projection of H onto V. 


Proposition 5 Let P be the orthogonal projection of a Hilbert space H onto a nontrivial 
closed subspace V of H. Then || P\| = 1 and | 


(Pu, v) = (u, Pv) forallu,ve H. (4) 
Proof Let the vector u belong to H. Then 
lu? = (P(u) + (Id—P)(u), P(v) + (Id—P)(v)) = | P(u)I? + (Id -—P)(u I? = HP (uIP, 


and hence || P(u)|| < ||u||. We therefore have ||P|| < 1 and conclude that ||P|| = 1 since 
P(v) = v for each nonzero vector in V. If the vector v also belongs to H, then 


(P(u), (Id-P)(v)) = (Id=P)(u), P(v)) = 


so that 
(P(u), v) = (P(u), P(v)) = (u, P(v)). U 
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The proofs of many results we established for. general Banach spaces are much simpler 
for the special case of Hilbert spaces; see Problems 11-15. 


Remark A Banach space X is said to be complemented provided every closed subspace 
of X has a closed linear complement. A Banach space X is said to be Hilbertable provided 
there is an inner product on X whose induced norm is equivalent to the given norm. We infer 
from Theorem 3 that a Hilbertable Banach space is complemented. A remarkable theorem of 
Joram Lindenstrauss and Lior Tzafriri asserts that the converse is true: If a Banach space is 
complemented, then it is Hilbertable. '| O 


PROBLEMS 
In the following problems, H is a Hilbert space. 


1. Let [a, b] be a closed, bounded interval of real numbers. Show that the L7[a, b] inner product 
is also an inner product on C[a, b]. Is C[a, b], considered as an inner product space with the 
L?[a, b] inner product, a Hilbert space? 


2. Show that the maximum norm on C{a, d] is not induced by an inner product and neither is 
the usual norm on £!. 


3. Let H, and HM be Hilbert spaces. Show that the Cartesian product H; X Hy also is a Hilbert 
space with an inner product with respect to which Hy X {0} = [{0} x Hp]+. 


4. Show that if S is a subset of an inner product space H, then S+ is a closed subspace of H. 
5. Let S be a subset of H. Show that S = (S1)+ if and only if S is a closed subspace of H. 


6. (Polarization Identity) Show that for any two vectors u, v € H, 
1 
(u,v) = Z[lhu + ull? — Ju — vf]. 


7. (Jordan-von Neumann) Let X be a linear space normed by || - ||. Use the polarization identity 
to show that || - || is induced by an inner product if and only if the parallelogram identity holds. 


8. Let V be aclosed subspace of H and P a projection of H onto V. Show that P is the orthogonal 
projection of H onto V if and only if (4) holds. 


9. Let T belong to £( H). Show that T is an isometry if and only if 
(T(u), T(v)) = (u, v) for all u, v € A. 


10. Let V be a finite dimensional subspace of H and g,...,@, a basis for V consisting of unit 
vectors, each pair of which is orthogonal. Show that the orthogonal projection P of H onto V 
is given by 


n 


P(h) = Sth, gx) gx for all h € V. 
k=1 


11. For h a vector in H, show that the function u +> (h, u) belongs to H™*. 


1CQn the complemented subspace problem,” Israel Journal of Math, 9, 1971. 
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12. For any vector h € H, show that there is a bounded linear functional  € H* for which 
lel] = 1 and #(h) = |All. 


13. Let V be a closed subspace of H and P the orthogonal projection of H onto V. For any 
normed linear space X and T € L(V, X), show that T o P belongs to £(H, X), and is an 
extension of T: V — X for which ||T o P|| = ||T. 


14. Prove the Hyperplane Separation Theorem for H, considered as a locally convex topological 
vector space with respect to the strong topology, by directly using Proposition 2. 


15. Use Proposition 2 to prove the Krein-Milman Lemma in a Hilbert space. 


16.2 THE DUAL SPACE AND WEAK SEQUENTIAL CONVERGENCE 


For E a measurable set of real numbers, 1 < p < oo, and q the conjugate to p, the Riesz 
Representation Theorem for L?(E) explicitly describes a linear isometry of L4(E) onto 
[L?(E)]*. The p = 2 case of this theorem extends to general Hilbert spaces. 


The Riesz-Frechet Representation Theorem Let H be a Hilbert space. Define the operator 
T: H -» H™ by assigning to each h € H the linear functional T(h): H — R defined by 


T(h)[u] = (h, u) forallu € H. (5) 
Then T is a linear isometry of H onto H*. 


Proof Let h belong to H. Then T(h) is linear. We infer from the Cauchy-Schwarz Inequality 
that the functional T(h): H - R is bounded and ||T(h)|| < ||Al]. But if h #0, then 
T(h)[h/lAll] = All. Therefore ||T(h)|| = ||hl]. Thus T is an isometry. It is clear that T is 
linear. It remains to show that T(H) = H*. Let Wo #0 belong to H*. Since Wo is continuous, 
its kernel is a closed proper subspace of H. By Theorem 3, since ker ys # H, we may choose 
a unit vector h, € H that is orthogonal to ker. Define ho = Wo(hs)h,. We claim that 
T(ho) = Wo. Indeed, for h € H, 


Wo(h) Wo(h) _ 
h- Wo(ha) € kero, so that (h — o(ha) h,) =0 
and therefore fjo(h) = (ho, h) = T(ho)[h]. O 


As in the case of a general Banach space, for a sequence in a Hilbert space H, we write 
{un} —~ u in H to mean that the sequence {u,} is a sequence in H that converges weakly to 
u € H. In view of the Riesz-Fréchet Representation Theorem, 


{un} — uv in A if and only if lim (h, u,) = (h, u) forallh € H. 
n 


Theorem 6 Every bounded sequence in a Hilbert space H has a weakly convergent 
subsequence. 


Proof Let {h,,} be a bounded sequence in H. Define Hp to be the closed linear span of {hj}. 
Then Hp is separable. For each natural number n, define w,, € [Ho]* by 


Wn(h) = (hy, h) for all h € Ap. 
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Since {h,} is bounded, we infer from the Cauchy-Schwarz Inequality that {W,} also is 
bounded. Then {y,,} is a bounded sequence of bounded linear functionals on the separable 
normed linear space Hp. Helley’s Theorem tells us that there is a subsequence {w/,,} of {Wn} 
that converges pointwise to #% € [Hp]*. According to the Riesz-Fréchet Representation 
Theorem, there is a vector hg € Hp for which % = T(ho). Thus 


lim (hyn,, h) = (ho, h) for all h € A. 
k—0o 
Let P be the orthogonal projection on H onto Ap. For each index k, since (Id—P)[H] = 
P(H)*, 
(hn,, (Id—P)[h]) = (ho, (Id—P)[h]) = 0 for all h € H. 


Therefore 
lim (An,, h) = (ho, h) for allh € H. 
k-0o 
Thus {h,,} converges weakly to ho in H. CO 


We gather in the following proposition some properties regarding weakly convergent 
sequences which we established earlier for general Banach spaces but which, because of the 
Riesz-Fréchet Representation Theorem, have much simpler proofs in the case of Hilbert 
spaces (see Problem 17). 


Proposition 7 Let {u,} — u weakly in the Hilbert space H. Then {u,} is bounded and 
[|@|] < lim inf ||up|I. 
Moreover, if {v,} — v strongly in H, then 


Jim (un, Un) = (u, v). (6) 


The following two propositions describe properties of weakly convergent sequences 
in a Hilbert space, which in Chapter 8 we already noted hold in the L?(£) spaces, for E a 
measurable set of real numbers and 1 < p < ov, but do not hold in general Banach spaces. 


The Banach-Saks Theorem Let {u,,} — u weakly in the Hilbert space H. Then there is a 
subsequence {un,} of {un} for which 

kim 2! tet bun, 
k— 00 


=u strongly in H. (7) 


Proof Replacing each u,, with u, — u we may suppose that u = 0. Since a weakly convergent 
sequence is bounded, we may choose M > 0 such that 


lun |2 < M for all n. 


We will inductively choose a subsequence {u,,,} of {u,,} with the property that for all k, 


un, tee tun, | < (24+ M)k. (8) 
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For such a sequence, 
. Un, +... + Un, 
k 


2 
— (2+) 
— ok 


for all k | (9) 


and the proof is complete. 


Define n; = 1. Since {u,} — 0 and u,, belongs to H, we can choose an index n> such 
that |(un,;, Un))| <1. Suppose we have chosen natural numbers n, <n <--- <n, such that 


lun, +++ +Un lI? < (2+M)jfor j=1,...,k 
Since {u,} — 0 and u,, +---+ un, belongs to H, we may choose n;41 > nx such that 
[(uny + +++ + Ung, Unyy)| <1. 
However, 
ey Po bag Hing? = Uttny He Fag? + (ny Hee A gs egg) + lene 
Therefore, | | 
ny te tng | < (24 M)k+2+M = (2+ M)(k+1). 
We have chosen a subsequence so that (8) holds. u 


The Radon-Riesz Theorem Let {u,,} > u weakly in the Hilbert space H. Then 
{un} — u Strongly in H if and only if lim lun || = |lull. 
n— 


Proof Since the norm is continuous on H, with respect to the strong topology, if {u,} > u 
strongly, then limy-, 09 un || = |u|]. On the other hand, if lim,_,o9 ||un|| = ||u/|, then since 


[lun — ull? = |lunll” — 2(un, u) + [lull for all n, 


the weakly convergent sequence {u,} is strongly convergent. | LJ 


Theorem 8 Let H be a Hilbert space. Then H is reflexive. Therefore every nonempty strongly 
closed bounded convex subset K of H is weakly compact and hence is the strongly closed 
convex hull of its extreme points. 


Proof To establish reflexivity it is necessary to show that the natural embedding J: H > 
[H*]* is onto. Let WV: H* -> R be a bounded linear functional. Let T: H > [H]* be the | 
isomorphism described by the Riesz-Fréchet Representation Theorem. Then Vo T: H > 
R, being the composition of bounded linear operators, is bounded. The Riesz-Fréchet 
Representation Theorem tells us that there is a vector ho € H for which V o T = T(ho). 

Therefore : 


¥(T(h)) = T(ho [A] = T(h)[ho] = J(ho )[T(h)] for all h € H. 


Since T(H) = H*, V = J(ho). Thus H is reflexive. We infer from Kakutani’s Theorem 
and Mazur’s Theorem that every strongly closed bounded convex subset K of H is weakly 
compact. Therefore, by the Krein-Milman Theorem and another application of Mazur’s — 
Theorem, such a set K is the strongly closed convex hull of its extreme points. L 


316 Chapter16 Continuous Linear Operators on Hilbert Spaces 


PROBLEMS 
16. Show that neither £', 2, L'[a, b] nor L™[a, b] is Hilbertable. 
17. Prove Proposition 7. 


18. Let H be an inner product space. Show that since H is a dense subset of a Banach space X 
whose norm restricts to the norm induced by the inner product on H, the inner product on 
H extends to X and induces the norm on X. Thus inner product spaces have Hilbert space 
completions. 


16.3 BESSEL’S INEQUALITY AND ORTHONORMAL BASES 


Throughout this section H is an inner product space. 


Definition A subset S of H is said to be orthogonal provided every two vectors in S are 
orthogonal. If such a set has the further property that each vector in S is a unit vector, then S 
is said to be orthonormal. 


The General Pythagorean Identity [f u;,u2,...,u, are n orthonormal vectors in H, and 
@,...,Q, are real numbers, then 


lau, +--+ +aquy ||? = llayll? +--+ + lanl. 
This identity follows from an expansion of the right-hand side of the following identity 
lors Hes + gull” = (oy +--+ + Oly, OU + +++ + Opty). 


Bessel’s Inequality For {¢,} an orthonormal sequence in H and h a vector in H, 


oO 


S) (g%, hy? < |All’. 
k=1 


To verify this inequality, fix a natural number n and define hy, = S7_, (gx, h)gx. Then, by 
the General Pythagorean Identity, 


O<|h—hyll? = hl]? —2(h, An) + Wall? 
= lhl? -2D\(h, gx)th, oe) + > (h, gx)? 
k=l k=1 
= [Al -— > (h, gx)’. 
k=1 


Therefore 
n 


S) (g%, hy? < |All’. 
k=1 


Take the limit as n > o¢ to obtain Bessel’s Inequality. 
Proposition 9 Let {g;,} be an orthonormal sequence in a Hilbert space H and the vector 


h belong to H. Then the series S7°,(¢x, h)px converges strongly in H and the vector 
h — Se1 (Gk, h)px is orthogonal to each oy. 
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Proof For a natural number n, define h, = D7_, (px, h)~x. By the General Pythagorean 
Identity, for each pair of natural numbers n and k, 


n+k 
Winsk — Fall’ = > (i, hy. 
i=n+1 


However, by Bessel’s Inequality, the series }1°°, (gi, h)* converges and hence {h,} is a 
Cauchy sequence in H. Since H is complete, S72, (¢x, 4)@x converges strongly to a vector 
h, € H. Fix a natural number m. Observe that if n > m, then h — h, is orthogonal to g,. By 
the continuity of the inner product, h — h, is orthogonal to @. 


Definition An orthonormal sequence {;} in a Hilbert space H is said to be complete 
provided the only vector h € H that is orthogonal to every gy, ish = 0. 


We infer from Corollary 4 that an orthonormal sequence {g;} in a Hilbert space H is 
complete if and only if the closed linear span of {g ;} is H. 


Definition An orthonormal sequence {g,} in a Hilbert space H is called an orthonormal 
basis for H provided 


00 
h= > (ek, h)g, for allh € H. (10) 
k=1 


Proposition 10 An orthonormal sequence {yx} in a Hilbert space H is complete if and only if 
it is an orthonormal basis. 


Proof First assume {¢;} is complete. According to the preceding proposition, hse (Gk, h) py 
is orthogonal to each g;. Therefore, by the completeness of {¢;}, (10) holds. Conversely, 
suppose (10) holds. Then if h € H is orthogonal to all g;, then 


o.@) o.@) 
h= > (ox, h)ex = >) 0-o% =0. 
k=1 k=1 ‘a 


Example The countable collection of functions in L[0, 27] consisting of the constant func- 
tion that takes the value 1/4/27 and the functions (1/./27-sin kt, 1/./27-cos kt}, are acom- 
plete orthonormal sequence for the Hilbert space L7[0, 27]. Indeed, we infer from the ele- 
mentrary trigonometric identities that this sequence is orthonormal. We infer from the 
Stone-Weierstrass Theorem that the linear span of this sequence is dense, with respect to the 
maximum norm, in the Banach space C[a, b]. Thus, by the density of C[a, b] in L7[0, 277], 
the linear span of this sequence is dense in L7[0, 27]. 


If a Hilbert space H possesses an orthonormal basis {;}, then, since finite rational 
linear combinations of the g,’s are a countable dense subset of H, H must be separable. 
It turns out that separability is also a sufficient condition for a Hilbert space to possess an 
orthonormal basis. 


Theorem 11 Every infinite dimensional separable Hilbert space posesses an orthonormal 
basis. 
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Proof Let F be the collection of subsets of H that are orthonormal. Order F by inclusion. 
For évery linearly ordered subcollection of F, the union of the sets in the subcollection 
is an upper bound for the subcollection. By Zorn’s Lemma, we may select a maximal 
subset So of F. Since H is separable, So is countable. Let {g,}7°2, be an enumeration of 
So. lf h € H,h#0, then, by Proposition 9, h — S7°, (gx, h)px is orthogonal to each gy. 
Therefore h — °°, (gx, h)gx = 0, for otherwise the union of So and the normalization of 
h- YP (Gk, bh), would be an orthonormal set that properly contains So. Therefore {g,}7°, 
is an orthonormal basis for H. C] 


PROBLEMS 
19. Show that an orthonormal subset of a separable Hilbert space H must be countable. 


20. Let {g,} be an orthonormal sequence in a Hilbert space H. Show that {g;} converges weakly 
to 0in H. 


21. Let {¢;,} be an orthonormal basis for the separable Hilbert space H. Show that {u,} —uin H 
if and only if for each k, limy+o9(Un, Ge) = (U, Gx). 


22. Show that any two infinite dimensional separable Hilbert spaces are isometrically isomorphic 
and that any such isomorphism preserves the inner product. 


23. Let H be a Hilbert space and V a closed separable subspace of H for which {gx} is an 
orthonormal basis. Show that the orthogonal projection of H onto V, P, is given by 


[o,¢) 
P(h) = >. (ox, h)g for allh € H. 
k=1 


24. (Parseval’s Identities) Let {y;,} be an orthonormal basis for a Hilbert space H. Verify that 


oO 
Whi? = S (gx, h)? for all h € H. 
k=1 


Also verify that 


(u, v) = Sax «by for alll u, v € H, 
k=1 
where, for each natural number k, a, = (u, o,) and by = (v, ox). 
25. Verify the assertions in the example of the orthonormal basis for L7[0, 27]. 
26. Use Proposition 10 and the Stone-Weierstrass Theorem to show that for each f € L?[—7, a], 


ie, @) 
f(x) =a9/2+ > [a, - coskx + by - sin kx], 
k=1 


where the convergence is in L*[—7, 7] and each 


1/7 1 {7” 
ay = -| f(x) coskx dx and by, = -| f(x) sinkx dx 
T Jaa —7 
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16.4 ADJOINTS AND SYMMETRY FOR LINEAR OPERATORS 


Throughout this section, H denotes a Hilbert space. We denote £(H, H) by £(H). Let T 
belong to £( #). For a fixed vector v in H, the mapping 


ut>(T(u), v) foru € H, 


belongs to H* since it is linear and, by the Cauchy-Schwarz Inequality, |(T(u), v)| <c- |Jull 
for allu € H, where c = ||T|| - ||v||. According to the Riesz-Fréchet Representation Theorem, 
there is a unique vector h € H such that (T(u), v) = (h, u) = (u, h) for all u € H. We 
denote this vector h by T*(v). This defines a mapping 7*: H > H that is determined by the 
relation 

(T(u), v) = lw, T*(v)) for allu, v € H. (11) 


We call 7* the adjoint of T. 
Proposition 12 Let H be a Hilbert space. If T belongs to £L( H), so does T* and \|T|| = ||T* ||. 


Proof Clearly 7* is linear. Let h be a unit vector in H. Then, by the Cauchy-Schwarz 
Inequality, | 


\T*(h)|P = (F*(h), T*(H)) = (TCT), hy < ATH 
Thus T* belongs to £( #) and ||7*|| < ||T']. But also observe that 
(TAYE = (T(h), T(h)) = (T(T(A)), h) < IT*IITCADL 
Therefore ||T|| < ||7*}). LJ 


We leave it as an exercise to verify the following structural properties of adjoints: for 
T,SEL(H), 


(T*)"=T, (T+S)* =7* + Sand (ToS)* = S* oT™. (12) 


Proposition 13 Let H be a Hilbert space. Suppose T belongs to L(H) and has a closed image. 
Then —— 
ImT @ ker T* = H. | (13) 


Proof Since Im T is closed, it suffices, by Theorem 3, to show that ker 7* = [ImT]+. But 
this is an immediate consequence of the relation (11). L 


Proposition 14 Let H be a Hilbert space. Suppose T belongs to L( H) and there is ac >0 for 
which? 


(T(h), h) > ||h|* forall h € H. (14) 
Then T is invertible. 


2 An operator T for which (14) holds is called positive definite. 
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Proof The inequality (14) implies that ker T = {0}. We infer from (14) and the Cauchy- 
Schwarz Inequality that 


|T7(u) — T(v)|| = cllu — v|] for all u, v € H. 


We claim that T has closed range. Indeed, let {T(h,)} > h strongly in H. Then {T(h, )} is 
Cauchy. The above inequality implies that {h,} is Cauchy. But H is complete and therefore 
there is a vector h, to which {h,} converges strongly. By the continuity of T, T(h,) = h. 
Therefore T(#) is closed. We also claim that ker T* = {0}. Indeed, by (14), the symmetry of 
the inner product and the definition of the adjoint, 


(T*(h), h) = (T(h), h) > ||hl/? for all hh € H. 
Therefore ker T* = {0}. We infer from the preceding proposition that T( H) = H. L 


An examination of the proof of the Riesz-Fréchet Representation Theorem reveals 
that the symmetry of the inner product was not used. The following important generalization 
of this theorem has many applications in the study of partial differential equations. 


Theorem 15 (the Lax-Milgram Lemma) Let H be a Hilbert space. Suppose the function 
B: HXH —> R has the following three properties: 


(i) For eachu € H, the following two functionals are linear on H; 
v+> B(u, v) and v'> B(v, u). 
(ii) There is acy > 0 for which 
|B(u, v)| <cy- lull - [lvl] for all u,v € H. 
(iti) There is a cz > 0 for which 
B(h, h) > cp: \\h|\? forallh € H. 
Then for each fb € H%, there is a unique h € H for which 
w(u) = B(h, u) forallu € H. 


Proof Let T: H — H* be the isomorphism defined by the Riesz-Fréchet Representation 
Theorem, that is, for each h € H, 


T(h)[u] = (h, u) for allu € H. (15) 
For each h € H, define the functional S(h): H > R by 
S(h)[u] = B(h, u) for allu € H. (16) 


We infer from assumptions (i) and (ii) that each S$(h) is a bounded linear functional on H 
and that the operator S: H — H* is linear and continuous. Since T is an isomorphism of H 
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onto H*, to show that S is an isomorphism of H onto H™* is equivalent to showing that the 
operator T~! o S € £L(H) is invertible. However, by assumption (iii), 


((T~!oS)(h), h) = S(h)[h] = B(h, h) > co - ||All’ for all h € H. 
The preceding proposition tells us that T~! o S € £L(H) is invertible. a 


Definition An operator T € £L(H) is said to be symmetric or self-adjoint provided T = T*, 
that is, 


(T(u), v) = (u, T(v)) forallu,v eH. 


Example Let {gy} be an orthonormal basis for the separable Hilbert space H and T belong 
to £(H). Then, by the continuity of the inner product, T is symmetric if and only if 


(T(9;), ¢;) = (T(¢;), @) for alll <i, j <a. 


In particular, if H is Euclidean space R", then T is symmetric if and only if the n Xn matrix 
that represents T with respect to an orthonormal basis is a symmetric matrix. 


A symmetric operator T € £(H) is said to be nonnegative, written T > 0, provided 
(T(h), h) => 0 for all h € H. Moreover, for two symmetric operators A, B € £(H), we write 
A > B provided A — B > 0. The sum of nonnegative, symmetric operators is nonnegative 
and symmetric. Moreover, 


if T € £( H) is symmetric and nonnegative, then sois S*7SforanyS¢eL(H), (17) 


since for eachh € H, (S*TS(h), h) = (T(S(h)), S(h)) = 0. In Problems 37-43 we explore a 
few of the many interesting consequences of this order relation among symmetric operators. 


The Polarization Identity For a symmetric operator T € L( HF), 


(T(u), v) = ; [(T(u+v),u+v)—(T(u—v),u—v)] forallu,ve H. (18) 


To verify this identity, simply expand the two inner products on the right-hand side. If we 
associate with a symmetric operator T € £( H) the quadratic form Q7: H — R defined by 


Qr(u) = (T(u), u) for allu € H, 


the Polarization Identity tells us that T is completely determined by Q7. In particular, T = 0 
on H if and only if Q@7 = 0 on H. In fact, the following much sharper result holds. It is 
useful, for T € £(H) and A € R, to denote AIld—T by A — T, where the identity mapping 
Id: H > His defined by Id(h) =A for allh € ZH. 


Proposition 16 Let H be a Hilbert space and the operator T € L(H) be symmetric. Then 


ITH = sup (T(x), wh (19) 
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Proof Denote sup j,)~1|(T(u), u) by n. If 7 = 0, we infer from the Polarization Identity 
that T = 0. So consider the case 7 > 0. Observe that, by the Cauchy-Schwarz Inequality, for 
a unit vector u € H, 

\(T(u), u)| < WT (w)ihllull < ITI. 


Thus, 4 < ||T||. To prove the inequality in the opposite direction, observe that the two 
symmetric operators y — T and 1+ T are nonnegative and therefore, by (17), the two 
operators 


(n+T)"(n-T)(n+T) =(n+T)(n-T)(n+T) 
and 
(n—T)"(n+T)(n-T)=(n-T)(n+T)(q-T) 


also are nonnegative and hence so is their sum 2n(7° — T*). Since 2n > 0, 7° — T? is 
nonnegative, that is, 


IT (u)IP = (T(u), T(u)) = (T?(u), u) < 7 (u, u) = 1? lull? for alll ue H. 
Hence ||T|| < 7. LJ 


A general strategy in the study of a linear operator T € £(H) is to express H as a 
direct sum H; © H) for which T(H,) C Hy and T(H)) C H>. When this occurs we say the 
decomposition H = H; ® H) reduces the operator T. In general, if T( H, ) C Hy we cannot 
infer that T( H)) C Hp. However, for symmetric operators on H and an orthogonal direct 
sum decomposition of H, we have the following simple but very useful result. 


Proposition 17 Let H be a Hilbert space. Suppose the operator T € L(H) is symmetric and 
V is a subspace of H for which T(V) C V. Then T(V+) CV+. 


Proof Let u belong to V+. Then for any v € V, (T(u), v) = (u, T(v)) and (u, T(v)) =0 
since T(V) C V and u € V+. Thus T(u) € V+. O 


PROBLEMS 
27. Verify (12). 
28. Let T and S belong to £(H) and be symmetric. Show that T = S if and only if Or = Qs. 


29. Show the symmetric operators are a closed subspace of £(H). Also show that if T and S are 
symmetric, then so is the composition S o T if and only if T commutes with S with respect to 
composition, that is, SoT=ToS. 


30. (Hellinger-Toplitz) Let H be a Hilbert space and the linear operator T: H > H have the 
property that the (T(u), v) = (u, T(v)) for all u, v € h. Show that T belongs to £(H). 


31. Exhibit an operator T € £(R*) for which ||T|| > sup y,y-11(7(u), u)|.- 
32. Let S and T in £( H) be symmetric. Assume S > T and T > S. Prove that T = S. 


33. Let V be a closed nontrivial subspace of a Hilbert space H and P the orthogonal projection 
of H onto V. Show that P = P*, P > 0, and ||P|| = 1. 


34. Let P € £(H) be a projection. Show that P is the orthogonal projection of H onto P(H) if 
and only if P = P*. 


35. 


36. 
37. 


38. 


39. 


40. 


41. 


42. 
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Let {¢} be an orthonormal basis for a Hilbert space H and for each natural number n, define 
P,, to be the orthogonal projection of H onto the linear span of {¢1,..., Gn}. Show that P,, is 
symmetric and | 


0 < Py < Pai < Id for all n. 
Show that {P,,} converges pointwise on H to Id but does not converge unformly on the unit ball. 
Show that if T € £( H) is invertible, so is T* o T and therefore so is T*. 


(a General Cauchy-Schwarz Inequality) Let T ¢ £( H) be symmetric and nonnegative. Show 
that for all u, v € H, 


K(T(u), v)? < (T(u), u)-(T(v), v). 


Use the preceding problem to show that if S, 7 € £(H) are symmetric and S > T, then for 
each u € H, 


|S(u)-T(u)|* = ((S-T)(u), (S—T)(u))’ <|((S-T)(w), u)IK(S—T)*(u), (S—T)(u))| 


and thereby conclude that 


‘WS(u) - T(u)|I* < |(S(u), u) — (T(w), u)|- JS — TIP - ful? 


(a Monotone Convergence Theorem for Symmetric Operators) A sequence {T,,} of symmetric 
operators in £( 1) is said to be monotone increasing provided T,41 > T, for each n, and said to 
be bounded above provided there is a symmetric operator S in £( H) such that 7, < Sforalln. 


(i) Use the preceding problem to show that a monotone increasing sequence {Tn} of sym- 
metric operators in £( H) converges pointwise to a symmetric operator in £(H) if and 
only if it is bounded above. 


(ii) Show that a monotone increasing sequence {T,} of symmetric operators in L(H) is 
bounded above if and only if it is pointwise bounded, that is, for each h € H, the 
sequence {T,,(h)} is bounded. 


Let S € £(H) be a symmetric operator for which 0 < S < Id. Define a sequence {T} in 
L( H) by letting T; = 1/2(Id —S) and if n is a natural number for which T, € £(H) has been 
defined, defining 7,41 = 1/2(Id—S + T7). 


(i) Show that for each natural number n, T,, and 7,4; — T, are polynomials in Id—S with 


nonnegative coefficients. 


(ii) Show that {7,,} is a monotone increasing sequence of symmetric operators that is bounded 
above by Id. 


(iii) Use the preceding problem to show that {T,} converses pointwise to a symmetric 
operator T for which 0 < T < Id and T = 1/2(Id—S + T*) 


(iv) Define A = (Id—T). Show that A? = S. 


(Square Roots of Nonnegative Symmetric Operators) Let T € L(H ) be a nonnegative 
symmetric operator. A nonnegative symmetric operator A € £(H#) is called a square root of 
T provided A* = T. Use the inductive construction in the preceding problem to show that 
T has a square root A which commutes with each operator in C( H) that commutes with T. 

Show that the square root is unique: it is denoted by JT. Finally, show that T is invertible if 
and only if JT is invertible. 


An invertible operator T € £(H) is said to be orthogonal provided T~! = T*. Show that an 
invertible operator is orthogonal if and only if it is an isometry. 
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43. (Polar Decompositions) Let T € £(H) be invertible. Show that there is an orthogonal 
invertible operator A € £(H) and a nonnegative symmetric invertible operator B € £(H) 
such that T = Bo A. (Hint: Show that T*T is invertible and symmetric and let B = JT 0 T*.) 


16.5 COMPACT OPERATORS 


Definition An operator T € L( H) is said to be compact provided T(B) has compact closure, 
with respect to the strong topology, where B is the closed unit ball in H. 


Any operator T € £( H) maps bounded sets to bounded sets. An operator T € £(H) 
is said to be of finite rank provided its image is finite dimensional. Since a bounded subset of 
a finite dimensional space has compact closure, every operator of finite rank is compact. In 
particular, if H is finite dimensional, then every operator in £( H) is compact. On the other 
hand, according to Riesz’s Theorem, or by Theorem 11, the identity operator Id: H > H 
fails to be compact if H is infinite dimensional. For the same reason, an invertible operator 
in £(H) fails to be compact if H is infinite dimensional. 

In any metric space, compactness of a set is the same as sequential compactness. 
Furthermore, since a metric space is compact if and only if it is complete and totally 
bounded, a subset of a complete metric space has compact closure if and only if it is totally 
bounded. We therefore have the following useful characterizations of compactness for 
a bounded linear operator. 


Proposition 18 Let H be a Hilbert space and K belong to £L(H). Then the following are 
equivalent: 


(i) K is compact; 
(ii) K(B) is totally bounded, where B is the closed unit ball in H; 


(iii) If {hy} is a bounded sequence in H, then {K(h,)} has a strongly convergent subse- 
quence. 


Example Let {¢;,} be an orthonormal basis for the separable Hilbert space H and {A,} a 
sequence of real numbers that converges to 0. Define 


ie, @) 
T(h) = >) Acth, G)px for h € H. 
k=1 
We infer from Bessel’s Inequality and the boundedness of {A;} that T belongs to £(H) 
and we claim that T is compact. According to the preceding proposition, to show that K 
is compact it suffices to show that 7(B) is totally bounded. Let ¢ > 0. Choose N such that 
|Ax| < €/2 for k > N. Define Ty € £(H) by 


N 
Ty(h) = >) Acth, e)px for h € H. 
k=1 
We infer from Bessel’s Inequality that ||T(“) — Tn(h)|| < €/2||h|| for h € H. But Ty(B) is 
a bounded subset of a finite dimensional space, so it is totally bounded. Let « > 0. There 
is a finite €/2-net for Ty(B) and by doubling the radius of each of the balls in this net we get 
a finite «-net for T(B). 
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A linear operator T: H — H belongs to £( 7) if and only if it maps weakly convergent 
sequences to weakly convergent sequences (see Problem 47). 


Proposition 19 Let H be a Hilbert space. Then an operator T in £L( H) is compact if and only 
if it maps weakly convergent sequences to strongly convergent sequences, that is, 


if {tn} ~h, then {T(h,)} > T(h). 


Proof According to the preceding proposition, an operator is compact if and only if it 
maps bounded sequences to sequences that have a strongly convergent subsequence. First 
assume that T is compact. Observe that for any operator T € C(H), if {ux} — u, then 
{T(un)} —~ T(w), since for each v € H, 


Jim (T(ux), v) = lim { (uz, T*(v)) = (u, T*(v)) = (T(w), v). 


Let {h,} — h in H. By the compactness of 7, every subsequence of {T(h,,)} has a further 
subsequence that converges strongly and, by the preceding observation, its strong limit 
must be 7(h). Therefore the entire sequence {7(h, )} converges strongly to T(h). To 
prove the converse, assume 7 maps weakly convergent sequences to strongly convergent 
subsequences. Let {h,} be a bounded sequence. Theorem 6 tells us that {h,,} has a weakly 
convergent subsequence. The image of this weakly convergent subsequence converges 
strongly. L 


Schauder’s Theorem A compact linear operator on a Hilbert space has a compact adjoint. 


Proof Let K € £(H) be compact. According to the preceding proposition, it suffices to show 
that K* maps weakly convergent sequences to strongly convergent sequences. Let {h,} — h 
in H. For each n, 


|K*(hn) — K*(h)|I° = (KK*(In) — KK*(h), fy — hi). (20) 


Since K* is continuous, {K*(h, )} converges weakly to K*(h). The preceding proposition 
tells us that {K K*(h, )} > KK*(h) strongly in H. Therefore, by Proposition 7, 


lim (KK*(h,) — KK*(h), hy, —h) = 0. 
ko 
We infer from (20) that {K*(h, )} converges strongly to K*(h). L 


PROBLEMS 
44. Show that if H is infinite dimensional and T € £( #7) is invertible, then T 1s not compact. 
45. Prove Proposition 18. 


46. Let K( 1) denote the set of compact operators in £( H). Show that K(#) is a linear subspace 
of £( 1). Moreover, show that for K € K(H) and T € £(H), both K o T and T o K belong to 
K(A). 


47. Show that a linear operator 7: H — H is continuous if and only if it maps weakly convergent 
sequences to weakly convergent sequences. 
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48. Show that K € £(#) is compact if and only if whenever {u,} — u in H and {v,} — vin H, 
then (K(un), vn) > (K(u), v). 


49. Let {P,} be a sequence of orthogonal projections in £(H) with the property that for natural 
numbers n and m, P,(H) and P,,(H) are orthogonal finite dimensional subspaces of H. Let 
{A,} be a bounded sequence of real numbers. Show that 


is a properly defined symmetric operator in £( H ) that is compact if and only if {A,} converges 
to 0. 


50. For X a Banach space, define an operator T € £(X) be be compact provided 7(B) has 
compact closure. Show that Proposition 18 holds for a general Banach space and Proposition 
19 holds for a reflexive Banach space. 


16.6 THE HILBERT-SCHMIDT THEOREM 


A nonzero vector u € H is said to be an eigenvector of the operator T € £L(H) provided 
there is some A € R for which T(u) = Au. We call A the eigenvalue of T associated with the 
eigenvector u. One of the centerpieces of linear algebra is the following assertion: If H is a 
finite dimensional Hilbert space and T € £(H) is symmetric, then there is an orthonormal 
basis for H consisting of eigenvectors of T, that is, if H has dimension n, there is an 
orthonormal basis {¢1,...,@,} for H and numbers {A,,..., An} such that T(x) = Agg, for 
1<k <n. Thus 


T(h) = >) Aclh, py)g, for allh € H. (21) 

k=1 
Of course, in the absence of symmetry, a bounded linear operator, even on a finite 
dimensional space, may fail to have any eigenvectors. As the following example shows, 


even a symmetric operator on an infinite dimensional Hilbert space may fail to have any 
eigenvectors. 


Example Define T € £(L*[a, b]) by [T(f)](x) = xf(x) for f € L?[a, b]. For u,v € 
L7[a, bl, 


b 
(T(u), ») =f au(x)o(x)dx= tu, T(0)) 
Thus T is symmetric and one easily checks that it has no eigenvectors. 


We associated with a symmetric operator T € £(H), the quadratic form 07: H > R 
defined by 


Qr(h) = (T(h), h) for allh € H. 
It is useful to define the Raleigh quotient for 7, Rr: H ~ {0} > R, by 
(T(h), h) 
(h, h) 


Observe that a maximizer h, for the quadratic form Q7 on the unit sphere § = {h € H||h|| = 
1} is a maximizer for the Raleigh quotient Rr on H ~ {0}. 


Rr(h) = for all h € H ~ {0}. 


Section 16.6 The Hilbert-Schmidt Theorem 327 


The Hilbert-Schmidt Lemma Let H be a Hilbert space and T € £L(H) be compact and 
symmetric. Then T has an eigenvalue A for which | 


|A] = ||7l| = sup |(T(h), A)]. | (22) 
Il|=1 


Proof If T = 0 on H, then every nonzero vector in H is an eigenvector of T with 
corresponding eigenvalue A = 0. So consider the case T #0. Proposition 16 tells us that 


Tl = sup |(7(h), A)I. 
[I =1 


By possibly replacing T by —T we may suppose that ||7'|| = sup inpai(T(h), h). Denote 
sup yayai(T(h), h) by n. Let S = {h € H|||h|| = 1} be the unit sphere in H. 


Let {h,} be asequence of unit vectors for which limy_,o9(T (hx), hg) = n. By Theorem 6, 
by possibly passing to a subsequence, we may suppose that {h;,} converges weakly to hx. 
We have ||h,|| < lim inf ||h,|| = 1. According to Proposition 19, since T is compact, {T (hp )} 
converges strongly to T(h, ). Therefore, by Proposition 7, 


lim (T(h), hy) = (T(he), he). 
- ko 
Thus 7 = (T(hs), hx). Now hy #0 since 7 #0. Moreover, h, must be a unit vector. Indeed, 
otherwise 0 < ||h,|| < 1, in which case the quadratic form Q7 takes a value greater than 7 
at h,/||hx|| € S, contradicting the choice of 7 as being an upper bound for Q7 on S. Thus 
h, € Sand Qr(h) < Qr(h,x) for all h € S. Therefore, for the Raleigh Quotient for T, Rr, 
we have | | 
Rr(h) < Rr(h, ) for allh e H~ {O}. 


Let ho be any vector in H. Observe that the function f: R — R, defined by f(t) = 
Rr(h, + tho) for t € R, has a maximum value at t = 0 and therefore f’(0) = 0. A direct 
calculation gives 


(T(ho), Rx) + (T(x), ho) (he, ho) + (ho, hs) 


0= f'(0) = — (T (ha), he) 


[ns ||? al 
But T is symmetric, h, is a unit vector, and n = (T(h,), hx) so that 
Since this holds for all hg € H, T(hy) = nhy. | O 


The Hilbert-Schmidt Theorem Let H be a Hilbert space. Suppose K € £(H) is a compact 
symmetric operator that is not of finite rank. Then there is an orthonormal basis {i} for 
[ker K]* together with a sequence of real nonzero numbers {Ax} such that limy_,o0 Ax = 0 and 
K (dy) = Age for each k. Thus 


 K(h)= s Agth, We)We for all h E H. (23) 
k=1 
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Proof Let S be the unit sphere in H. According to the Hilbert-Schmidt Lemma, we may 
choose a vector y € S and p, € R for which 


K(1) = pay, and || = sup [(K(h), h)}. 


Since K #0, we infer from Proposition 16 that 4; #0. Define H, = [span {y,}]1. Since 
K(span {y}) © span {ys}, it follows from Proposition 17 that K( H; ) C Hj. Thus if we define 
K; to be the restriction of K to H;, then K, € £( Hj) is compact and symmetric. We again 
apply the Hilbert-Schmidt Lemma to choose a vector ys € SM Hy and pz € R for which 


K (2) = wat and |u2| = sup {|(K(h), h)|| he SOM}. 


Observe that |u| < ||. Moreover, since K does not have finite rank, we again use 
Proposition 16 to conclude that 42#0. We argue inductively to choose an orthogonal 
sequence of unit vectors in H, {;,}, and a sequence of nonzero real numbers {yu,;} such that 
for each index k, 


K(i) = mathe and |ag| = sup {\(K(h), #)| | he SA[spaniya,...tall}. (24) 


Observe that {|x|} is decreasing. We claim that {u,} > 0. Indeed, otherwise, since this 
sequence is decreasing, there is some € > 0 such that |u;| > ¢ for all k. Therefore, for natural 
numbers m and n, since yw, is orthogonal to Pm, 


WK(Yn) — K(Wm U7 = wlll? + we, llrall? = 22. 


Thus {K(;)} has no strongly convergent subsequence and this contradicts the compactness 
of the operator K. Therefore {uu} — 0. Define Hp to be the closed linear span of {Wi dpey: 
Then, by Proposition 10, {Wx }f2, 18 an orthonormal basis for Ho. Since K( Ho) C Ab, it follows 
from Proposition 17 that K( Hp) C Hj. But observe that if h ¢ Hj is a unit vector, then, 
for each k, h € SM [span {1, ..., we_-1}]+ and therefore |(K(h), h)| < |ux|. Since {uz} > 0, 
(K(h), h) = 0. Thus Q7 = 0 on Hy and hence, by the polarization identity, ker K = Hy. 
Thus Ho = [ker K]+. O 


In case a symmetric operator T € C( H) has finite rank, define Hp to be the image of T. Then 
ker T = Hp. The above argument establishes a finite orthonormal basis for Hp consisting of 
eigenvectors of T, thereby recovered a basic result of linear algebra that was mentioned at 
the beginning of this section. 


PROBLEMS 
51. Let H be a Hilbert space and T € L( H) be compact and symmetric. Define 


a= inf (T(h),h)andB= sup (T(h), h). 
est Ir=1 


Show that if a < 0, then @ is an eigenvalue of T and if B > 0, then B is an eigenvalue of T. 
Exhibit an example where a = 0 and yet a is not an eigenvalue of T, that is, T is one-to-one. 
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52. Let H be a Hilbert space and K ¢ £(H) be compact and symmetric. Suppose 


sup yryai (K(h), h)=B> 0. 


Let {h,} be a sequence of unit vectors for which lim,-,.0(K(hn), hn) = B. Show that a sub- 
sequence of {h,,} converges strongly to an eigenvector of T with corresponding eigenvalue B. 


16.7 THE RIESZ-SCHAUDER THEOREM: CHARACTERIZATION 
OF FREDHOLM OPERATORS 


A subspace Xo of the Banach space X is said to be of finite codimension in X provided Xo has 
a finite dimensional linear complement in X, that is, there is a finite dimensional subspace 
X, of X for which X = Xo ® X,. The codimension of Xp, denoted by codim Xo, is properly 
defined to be the dimension of a linear complement of Xo; all linear complements have the 
same dimension (see Problem 66). A cornerstone of linear algebra is the assertion that if X 
is a finite dimensional linear space and T: X > X is linear, then the sum of the rank of T 
and the nullity of T equals the dimension of X, that is, if dim X = n, | 


dimIm T + dim ker T = n, 
and therefore, since codim Im T =n — dim Im T, 
dim ker T = codim Im T. | (25) 


Our principal goal in this section is to prove that if H is a Hilbert space and the operator 
T € £(H) isa compact perturbation of the identity operator, then T has a finite dimensional 
kernel and a finite codimensional image for which (25) holds. 


Proposition 20 Let H be a Hilbert space and K € £(H) be compact. Then Id +K has finite 
dimensional kernel and a closed image. 


Proof Suppose ker (Id +K) is infinite dimensional. We infer from Proposition 11 that there 
is an orthogonal sequence of unit vectors {u,} contained in ker (Id+K). Since ||K(un) — 
K(um )|| = |lUn —Um|| = J/2, if m én, the sequence {K(u, )} has no convergent subsequence. 
This contradicts the compactness of the operator K. Thus dim[ker (Id+K)] < oo. Let 
Ho = [ker (Id+K)]+. We claim that there is a c > 0 for which 


lu + K(u)|| = cllul| for all u € Ap. (26) 


Indeed, if there is no such c, then we can choose a sequence {h,} of unit vectors in Hp such 
that {u, + K(u,)} > 0 strongly in H. Since K is compact, by passing to a subsequence if 
necessary, we may suppose that {K(u,)} — ho strongly. Therefore {u,} —- —ho strongly. 
By the continuity of K, ho + K(ho) = 0. Thus ho is a unit vector that belongs to both 
[ker (Id+K)]*+ and ker (Id+K). This contradiction confirms the existence of a c > 0 for 
which (26) holds. We infer from (26) and the completeness of Ho that (Id+K)( Ho) is closed. 
Since (Id +K)( Ho) = = (Id+K)(H), Im(Id+X) is closed. O 
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Definition Let {¢,} be an orthonormal basis for the separable Hilbert space H. For each n, 
define P, € L(H) by 


n 


Pr(h) = > (px, hoy for allh € H. 
k=1 


We call {P,,} the orthogonal projection sequence induced by {y,}. 


For an orthogonal projection sequence {P,} induced by an orthonormal basis {9,}, 
each P, is the orthogonal projection of H into span {;,...,@,} and therefore || P,|| = 1. 
Moreover, by the very definition of an orthonormal basis, {P,} > Id pointwise on H. 
Therefore, for any T € L(H), {P, 0 T} is a sequence of operators of finite rank that 
converges pointwise to T on H. 


Proposition 21 Let {P,} be the orthogonal projection sequence induced by the orthonormal 
basis {~n} for the separable Hilbert space H. Then an operator T € L( H) is compact if and 
only if {P, 0 T} > T in L(B). 


Proof First assume {P, 0 T} > T in £(H). For each natural number n, P, o T has finite 
dimensional range and therefore ( P,, o T)(B) is totally bounded, where B is the unit ball in 
H. Since {P, o T} > T € £L(H), {P, o T} converges uniformly on B to T. Therefore T(B) 
also is totally bounded. We infer from Proposition 18 that the operator T is compact. To 
prove the converse, assume T is compact. Then the set T(B) is compact with respect to the 


strong topology. For each natural number n, define ¢,: T(B) > R by 


Wn(h) = ||Pa(h) — All for all h € T(B). 


Since each P, has norm 1, the sequence of real-valued functions {ip : T(B) > R} is 
equicontinuous, bounded, and converges pointwise to 0 on the compact set T(B). We infer 
from the Arzela-Ascoli Theorem that {f,: T(B) — R} converges uniformly to 0. This 


means precisely that {P, o T} > Tin £L(H). LI 


Proposition 22 Let H be a Hilbert space and K € £L(H) be compact. If Id +K is one-to-one, 
then it is onto. 


Proof We leave it as an exercise (Problem 53) to show that there is a closed separable 
subspace Ho of H for which K( Ho) C Ho and K = 0 on Hj. Therefore, by replacing H by 
Ho we may suppose H is separable. We argue as we did in the proof of Proposition 20 to 
show that there is a c > 0 for which 


[2 + K(h)|| = cllhl| for all h € H. (27) 


According to Theorem 11, H has an orthonormal basis {g,}. Let {P,} be the orthogonal 
projection sequence induced by {¢,,}. For each natural number n, let H, be the linear Span 
of {~1,..., @n}. Since the operator K is compact, according to the preceding proposition, 
{P,oK} — Kin £(H). Choose a natural number N for which || P, 0K — K'| <c/2 for alln > N. 
We infer from (27) that 


lu + Py o K(u)|| > c/2||ul| for all u € H andalln > N. (28) 


Section 16.7 The Riesz-Schauder Theorem: Characterization of Fredholm Operators 331 


To show that (Id+K)(H) = H, let h, belong to H. Let n > N. It follows from (28) that 
the restriction to H, of Id+P, o K is a one-to-one linear operator that maps the finite 
dimensional space H,, into itself. A one-to-one linear operator on a finite dimensional space 
is onto. Therefore this restriction maps H, onto H,,. Thus, there is a vector u,, € H,, for which 


Un + (Pao K)(un) = Py(hx). | (29) 


Take the inner product of each side of this equality with v € H and use the symmetry of the 
projection P,, to conclude that 


(un + K(Un), Pn(v)) = (he, Py(v)) for alln > N,v € H. (30) 
We infer from (29) and the estimate (28) that 
sll = Pa(he)ll = ltn + (Pao Kall > c/2llunll for alln > N. 


Therefore the sequence {u,,} is bounded. Theorem 6 tells us that there is a subsequence {hn,} 
that converges weakly toh € H. Therefore {h,, + K(hn,)} converges weakly toh +K (h). 
Take the limit as k — oo in (30) with n = n; to conclude, by Proposition 7, that 


| (u+K(u), v) = (hy, v) for all v € H. 
Therefore u + K(u) =h,. Thus (Id+K)(H)=H _ Oo 


The Riesz-Schauder Theorem Let H be a Hilbert space and K € cn ) be compact. Then 
Im(Id+K ) is closed and 
dim ker (Id +K) = dimker (Id+K*)<oo. (61) 


In particular, 1d+K is one-to-one if and only if it is onto. 


Proof According to Proposition 20, a compact perturbation of the identity has finite 
dimensional kernel and a closed image. We will show that 


dim ker (Id+K) > dim ker (Id+K*). _ (32) 


Once this is established, we replace K by K* and use the observation that (K*)* = K, together 
with Schauder’s Theorem regarding the compactness of K*, to obtain the inequality in the 
opposite direction. We argue by contradiction to verify (32). Otherwise, dim ker (Id+K) < 
dim ker (Id+K*). Let P be the orthogonal projection of H onto ker(Id+K) and A a 
linear mapping of ker (Id+K) into ker (Id+K*) that is one-to-one but not onto. Define 
K’ = K + Ao P. Since Id +K has closed image, Proposition 13 tells us that 


H = Im(Id+K) + ker (Id +K*) 


and therefore Id+K’ is one-to-one but not onto. On the other hand, since A o P is of finite 
rank, it is compact and therefore so is K’. These two assertions contradict the preceding 
proposition. Therefore (32) is established. Since Id +K has closed image, we infer from (14) 
and (32) that Id+K is one-to-one if and only if it is onto. O 
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Corollary 23 (the Fredholm Alternative) Let H be a Hilbert space, K « £L(H) compact, and 
#. a nonzero real number. Then exactly one of the following holds: 


(i) There is a nonzero solution of the following equation 
ph—K(h)=0,he dH. 

(ti) For every ho € H, there is a unique solution of the equation 
ph—K(h)=ho,he dH. 


Definition Let H be a Hilbert space and T belong to £L( H). Then T is said to be Fredholm 
provided the kernel of T is finite dimensional and the image of T has finite codimension. For 
such an operator, its index, ind T, is defined by 


ind T = dim kerT — codim ImT. 


In the proof of the Riesz-Schauder Theorem, we first established that Im T is closed and 
used this, together with (14), to show that Im T has finite codimension equal to dim ker T*. 
However, Theorem 12 of Chapter 13 tells us that if H is a Hilbert space and the operator 
T € £L(H) has a finite codimensional image, then its image is closed. Therefore each 
Fredholm operator has a closed image and hence, again by (14), codim Im T = dim ker 7*. 

We say that an operator T € L( #) is invertible provided it is one-to-one and onto. 
The Open Mapping Theorem tells us that the inverse of an invertible operator is continuous 
and therefore an invertible operator is an isomorphism. 


Theorem 24 Let H be a Hilbert space and T belong to L( H). Then T is Fredholm of index 0 
if and only if T = S + K, where S € L(H) is invertible and K € L( H) is compact. 


Proof First assume T is Fredholm of index 0. Since Im T is closed, Proposition 13 tells us that 
H =ImT ®@ ker 7™. (33) 


Since dim ker T = dim ker T* < 00, we may choose a one-to-one linear operator A of ker T 
onto ker 7*. Let P be the orthogonal projection of H onto ker T. Define K = Ao Pe £L(H) 
and S = T — K. Then T = $+ K. The operator K is compact since it is of finite rank, 
while the operator S is invertible by (33) and the choice of P and A. Hence T is a compact 
perturbation of a invertible operator. 


To prove the converse, suppose T = $+ K,where S € £(H) isinvertible and K € £(H) 
is compact. Observe that 
T = So[Id+S7! o K]. (34) 


Since S~! is continuous and K is compact, S~! o K is compact. The Riesz-Schauder Theorem 
tells us that Id+S—! o K is Fredholm of index 0. The composition of a Fredholm operator 
with an invertible operator is also Fredholm of index 0 (see Problem 55). We therefore infer 
from (34) that T is Fredholm of index 0. O 
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We leave it as an exercise to establish the following corollary. 


Corollary 25 Let H bea Hilbert space and T and S in L(A) be Fredholm of index 0. Then 
the composition S o T is also Fredholm of index 0. | 


Remark The Riesz-Schauder Theorem and Theorem 24 are true for operators on a general 
Banach space. However, the general method of proof must be different. An essential ingredient 
in the proof of Proposition 22 is the approximation in L( H) of a compact operator by an oper- 
ator of finite rank. Per Enflo has shown that there are linear compact operators on a separable 
Banach space that cannot be approximated in L( H) by linear operators of finite rank. 


53. 


54. 


55. 


56. 
57. 


58. 


59. 


60. 


61. 


PROBLEMS 


Let K € £L(H) be compact. Show that T = K*K is compact and symmetric. Then use the 
Hilbert-Schmidt Theorem to show that there is an orthonormal sequence {g;} of H such that 
T (gx) = Ange for all k and T(h) = 0 if h is orthogonal to {g,}°,. Conclude that if h is 
orthogonal to {g,}7°,, then | 


IK(h)I? = (K(h), K(h)) = (T(h), h) = 0. 
Define Hp to be the closed linear span of {K”(g;)|m > 1,k > 1}. Show that Ho is closed and 


separable, K( Hy) C Ho and K = 0 on Hy. 


Let K( #7) denote the set of compact operators in £( H). Show that K(H) is a closed subspace 
of £( #1) that has the set of operators of finite rank as a dense subspace. Is K(H) an open 
subset of £(H)? 


Show that the composition of a Fredholm operator of index 0 with an invertible operator. 1S 
also Fredholm of index 0. 


Show that the composition of two Fredholm operators of index 0 is also Fredholm of index 0. 


Show that an operator T € £( 7) is Fredholm of index 0 if and only if it is the perturbation 
of an invertible operator by an operator of finite rank. | 


Argue as follows to show that the collection of invertible operators in L(H ) is an open subset 
of £( 7). 


(i) ForA e€L(H ) with || Al| <1, use the completeness of £( HH) to show that the so-called 
Neumann series }'° , A” converges to an operator in £( H) that is the inverse of Id —A. 


(ii) For an invertible operator S € £(H) show that for any T € £(H), T = S|Id 48-17 — 
S)]. 

(iii) Use (i) and (ii) to show that if S € £L(H) is invertible then so is any T € £(H) for which 
|S— TI <1/)S~'I. 

Show that the set of operators in £(H) that are Fredholm of index 0 is an open subset of 

L(H). | 

By following the orthogonal approximation sequence method used in the proof of Proposi- 

tion 22, provide another proof of Proposition 14 in case H is separable. 


For T € £L(H), suppose that (T(h), h) > ||h||? for allh € H. Assume that K € L(H) is 
compact and T + K is one-to-one. Show that T + K is onto. 


3A counterexample to the approximation problem in Banach spaces,” Acta Mathematica, 130, 1973. 
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62. 
63. 


64. 


65. 


Let K € £(H) be compact and p € R have || > ||K||. Show that p — K is invertible. 


Let S ¢ £(H) have ||S|| <1, K € £(H) be compact and (Id+5 + K)(H) = H. Show that 

Id+S + K is one-to-one. 

Let GL( H) denote the set of invertible operators in £(H). 

(i) Show that under the operation of composition of operators, GL(H) is a group: it is called 
the general linear group of H. 


(ii) An operator T in GL(H) is said be orthogonal, provided that T* = T~!. Show that the 
set of orthogonal operators is a subgroup of GL( H): it is called the orthogonal group. 


Let H be a Hilbert space, T ¢ £(H) be Fredholm of index zero, and K € £( H) be compact. 
Show that T + K is Fredholm of index zero. 


. Let X be a finite codimensional subspace of a Banach space X. Show that all finite dimensional 


linear complements of Xq in X have the same dimension. 
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The first goal of the present chapter is to abstract the most important properties of Lebesgue 
measure on the real line in the absence of any topology. We shall do this by giving 
certain axioms that Lebesgue measure satisfies and base our theory on these axioms. As a 
consequence our theory will be valid for every system satisfying the given axioms. 

To establish that Lebesgue measure on the real line is a countably additive set function 
on a o-algebra we employed only the most rudimentary set-theoretic concepts. We defined 
a primitive set function by assigning length to each bounded interval, extended this set 
function to the set function outer measure defined for every subset of real numbers, and then 
distinguished a collection of measurable sets. We proved that the collection of measurable 
sets is a o-algebra on which the restriction of outer measure is a measure. We call this the 
Carathéodory construction of Lebesgue measure. The second goal of this chapter is to show 
that the Carathéodory construction is feasible for a general abstract set X. Indeed, we show 
that any nonnegative set function w defined on a collection S of subsets of X induces an 
outer measure p* with respect to which we can identify a o-algebra M of measurable sets. 
The restriction of 4* to M is a measure that we call the Carathéodory measure induced by 
p. We conclude the chapter with a proof of the Carathéodory-Hahn Theorem, which tells us 
of very general conditions under which the Carathéodory measure induced by a set function 
jis an extension of p. 


17.1 MEASURES AND MEASURABLE SETS 


Recall that a o-algebra of subsets of a set X is a collection of subsets of X that contains the 
empty-set and is closed with respect to the formation of complements in X and with respect 
to the formation of countable unions and therefore, by De Morgan’s Identities, with respect 
to the formation of intersections. By a set function we mean a function that assigns an 
extended real number to certain sets. 
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Definition By a measurable space we mean a couple (X,.M) consisting of a set X and a 
a-algebra M of subsets of X. A subset E of X is called measurable (or measurable with respect 
to M) provided E belongs to M. 


Definition By a measure yz on a measurable space (X, M) we mean an extended real-valued 
nonnegative set function ~: M — [0, oo] for which u(@) = 0 and which is countably additive 
in the sense that for any countable disjoint collection {Ex}°2., of measurable sets, 


(0 | = s L( Ex). 
k=! ial 


By a measure space (X, M, 4.) we mean a measurable space (X, M) together with a measure 
pt defined on M. 


One example of a measure space is (R, £, m), where R is the set of real numbers, £ the 
collection of Lebesgue measurable sets of real numbers, and m Lebesgue measure. A second 
example of a measure space is (R, B, m), where B is the collection of Borel sets of real 
numbers and m is again Lebesgue measure. For any set X, we define M = 2*, the collection 
of all subsets of X, and define a measure 7 by defining the measure of a finite set to be the 
number of elements in the set and the measure of an infinite set to be oo. We call 7 the 
counting measure on X. For any o-algebra M of subsets of a set X and point xo belonging to 
X, the Dirac measure concentrated at xo, denoted by dx, assigns 1 to a set in M that contains 
xo and 0 to a set that does not contain xo: this defines the Dirac measure space (X, M, 6,,). 
A slightly bizarre example is the following: let X be any uncountable set and C the collection 
of those subsets of X that are either countable or the complement of a countable set. Then 
C is a o-algebra and we can define a measure on it by setting (A) = 0 for each countable 
subset of X and y(B) = 1 for each subset of X whose complement in X is countable. Then 
(X, C, ) is a measure space. 

It is useful to observe that for any measure space (X, M, 2), if Xo belongs to M, then 
(Xo, Mo, Ho) is also a measure space where Mg is the collection of subsets of M that are 
contained in Xo and pg is the restriction of p to Mo. 


Proposition 1 Let (X, M, «) be a measure space. 
(Finite Additivity) For any finite disjoint collection { Ex}y_, of measurable sets, 


(U a = > H( Ex). 
k=1 k=1 
(Monotonicity) If A and B are measurable sets and A C B, then 
u(A) < p(B). 
(Excision) If, moreover, A C B and p( A) < ov, then 
u(B~ A) =p(B)—p(A), 


so that if u( A) = 0, then 
u(B~ A) = p(B). 
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(Countable Monotonicity) For any countable collection {Ex}7°, of measurable sets 
that covers a measurable set E, 


OO 
w(E) < ¥ p(x). 
k=1 
Proof Finite additivity follows from countable additivity by setting E, = 9, so that 
HE; ) = 0, for k > n. By finite additivity, 


u(B) = p(A)+p(B~ A), 


which immediately implies monotonicity and excision. To verify countable monotonicity, 
define G; = E, and then define 


G,=E.~ 


k-1 
\_) E;| for all k > 2. 
i=1 


Observe that 
o @) o @) 
{Gi}72, is disjoint, (_) G, = (_J Ex and G; C E; for all k. 
k=1 k=l 


From the monotonicity and countable additivity of yz we infer that 


oe) <0(0 a)=4(0 ox) = Salon) = $ ale. : 
k=l k=l i=l i=l 


The countable monotonicity property is an amalgamation of countable additivity and 
monotonicity, which we name since it is invoked so frequently. 

A sequence of sets {E,}7°, is called ascending provided for each k, E, C Ex41, and 
said to be descending provided for each k, Ex, C Ex. 


Proposition 2 (Continuity of Measure) Let (X, M, 2) be a measure space. 


(i) If {Ax}7°, is an ascending sequence of measurable sets, then 


10. @) 
(0 a] = lim p(Ax). (1) 
k=l k—> oo 
(it) If {By}°, is a descending sequence of measurable sets for which p(B) < 00, then 
10. @) 
we () By} = lim p(B). (2) 
k=1 k-> 00 


The proof of the continuity of measure is the same, word for word, as the proof of the 
continuity of Lebesgue measure on the real line; see page 44. 

For a measure space (X, M, yw) and a measurable subset E of X, we say that a property 
holds almost everywhere on E, or it holds for almost all x in E, provided it holds on E~ Eo, 
where Eo is a measurable subset of E for which y( Eo) = 0. 
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The Borel-Cantelli Lemma Let (X, M, 2) be a measure space and {E;}7°, a countable 
OO 

collection of measurable sets for which > p( E;,) < 00. Then almost all x in X belong to at 
k=1 


most a finite number of the E;’s. 


oO 
Proof For each n, by the countable monotonicity of wu, u(U;7o, Ex) < S (Ex). Hence, by 
k=n 
the continuity of p, 


(A 


n=1 


00 
Ue 
k=n 


OO oO 
= lim w(( Ex) < lim >) w( Ex) = 0. 
n—> oo NOOO 7, 


k=n 


Observe that)" , UZ, Ex| is the set of all points in X that belong to an infinite number of 
the E;’s. L] 


Definition Let (X, M,) be a measure space. The measure p is called finite provided 
p(X) < oo. Itis called o-finite provided X is the union of a countable collection of measurable 
sets, each of which has finite measure. A measurable set E is said to be of finite measure 
provided p( E) < oo, and said to be o-finite provided E is the union of a countable collection 
of measurable sets, each of which has finite measure. 


Regarding the criterion for o-finiteness, the countable cover by sets of finite measure 
may be taken to be disjoint. Indeed, if {X;,}7°, is such a cover replace, for k > 2, each X;, 
by X,~ Urry} X; to obtain a disjoint cover by sets of finite measure. Lebesgue measure on 
[0, 1] is an example of a finite measure, while Lebesgue measure on (—00, oo) is an example 
of a o-finite measure. The counting measure on an uncountable set is not o-finite. 

Many familiar properties of Lebesgue measure on the real line and Lebesgue integration 
for functions of a single real variable hold for arbitrary o-finite measures, and many 
treatments of abstract measure theory limit themselves to o-finite measures. However, many 
parts of the general theory do not require the assumption of o-finiteness, and it seems 
undesirable to have a development that is unnecessarily restrictive. 


Definition A measure space (X, M, 1) is said to be complete provided M contains all 
subsets of sets of measure zero, that is, if E belongs to M and p( E) = 0, then every subset of 
E also belongs to M. 


We proved that Lebesgue measure on the real line is complete. Moreover, we also 
showed that the Cantor set, a Borel set of Lebesgue measure zero, contains a subset that 
is not Borel; see page 52. Thus Lebesgue measure on the real line, when restricted to the 
o-algebra of Borel sets, is not complete. The following proposition, whose proof is left to the 
reader (Problem 9), tells us that each measure space can be completed. The measure space 
(X, Mo, uo) described in this proposition is called the completion of (X, M, 2). 


Proposition 3 Let (X, M, ) be a measure space. Define Mo to be the collection of subsets 
E of X of the form E = AU B where B € Mand ACC for some C € M for which p(C) = 0. 
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For such a set E define po(E) = p(B). Then Mo is a o-algebra that contains M, po is a 
measure that extends p, and (X, Mo, wo) is a complete measure space. 


PROBLEMS 


1. Let f be a nonnegative Lebesgue measurable function on R. For each Lebesgue measurable 
subset E of R, define u(E) = f, f, the Lebesgue integral of f over E. Show that p is a 
measure on the c-algebra of Lebesgue measurable subsets of R. 


2. Let M be a a-algebra of subsets of a set X and the set function 4: M — [0, oo) be finitely 
additive. Prove that yz is a measure if and only if whenever {A;}7°., is an ascending sequence 
of sets in M, then 


(0 a), = jim n p2( Ax). 


k=1 
3. Let M be a o-algebra of subsets of a set X. Formulate and establish a correspondent of the 
preceding problem for descending sequences of sets in M. 


4. Let {(Xy, My, Ma) }aca be a collection of measure spaces parametrized by the set A. Assume 
the collection of sets {X)},¢, is disjoint. Then we can form a new measure space (called their 
union) (X, B, yw) by letting X = Uyea Xa, B be the collection of subsets B of X such that 
BOX, € My for all A € A and defining u(B) = > palBNX,] for Be B. 

AEA 
(i) Show that M is a o-algebra. 
(ii) Show that yp is a measure. 
(iii) Show that yz is o-finite if and only if all but a countable number of the measures jz, have 
u(X,) = 0 and the remainder are o-finite. 


5. Let (X, M, «) be a measure space. The symmetric difference, FE; A E2, of two subsets Ey 
and E> of X is defined by 


E, AE) = [E, ~ Ep] U[E2 ~ E)]. 


(i) Show that if Z; and Ez are measurable and px( E; A E) = 0, then w( E,) = p( E2). 
(ii) Show that if 4 is complete, E; € M and E2 ~ E; € M, then E, € M if u( EF, AE2) =0. 
6. Let (X, M, 2) be a measure space and Xo belong to M. Define Mo to be the collection of 


sets in M that are subsets of Xq and jp the restriction of 4 to Mo. Show that (Xo, Mo, to) is 
a measure space. 


7. Let (X, M) be a measurable space. Verify the following: 


(i) If and v are measures defined on M, then the set function A defined on M by 
\(E) = p(E) + v(E) also is a measure. We denote A by w+ v. 


(11) If ~ and v are measures on M and yp > », then there is a measure A on M for which 
L=v +A. 


(iii) If v is o-finite, the measure A in (ii) is unique. 


(iv) Show that in general the measure A need not be unique but that there is always a smallest 
such A. 
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8. Let (X, M, 1) be a measure space. The measure p is said to be semifinite provided each 
measurable set of infinite measure contains measurable sets of arbitrarily large finite measure. 


(i) Show that each o-finite measure is semifinite. 

(ii) For E ¢ M, define 4 (E) = u(E) if u(E) < oo, and if u(E) = 00 define 4;(E) = © if 
E contains measurable sets of arbitrarily large finite measure and 1(E) = 0 otherwise. 
Show that 2; is a semifinite measure: it is called the semifinite part of y. 


(iii) Find a measure jz on M that only takes the values 0 and 00 and p = py + pp. 


9. Prove Proposition 3, that is, show that Mg is a a-algebra, uo is properly defined, and 
(X, Mo, uo) is complete. In what sense is Mp minimal? 


10. If (X, M, ) is a measure space, we say that a subset E of X is locally measurable provided 
for each B € M with p(B) <n, the intersection EN B belongs to M. The measure p is called 
saturated provided every locally measurable set is measurable. 


(i) Show that each o-finite measure is saturated. 
(ii) Show that the collection C of locally measurable sets is a c-algebra. 


(iii) Let (X, M, w) be a measure space and C the a-algebra of locally measurable sets. For 
E €C, define 2(E) = p(E£) if E € M and @( E) = 00 if E ¢ M. Show that (X, C, 72) is 
a Saturated measure space. 

(iv) If w is semifinite and E € C, set u(E) = sup {u(B)| B ¢ M, BC E}. Show that (X, C, p) 
is a saturated measure space and that yz is an extension of yu. Give an example to show 
that 2 and pu may be different. 7 


11. Let » and 7 be measures on the measurable space (X, M). For E € M, define v(E) = 
max{u(E), n( £)}. Isv a measure on (X, M)? 


17.2 SIGNED MEASURES: THE HAHN AND JORDAN DECOMPOSITIONS 


Observe that if 4; and 2 are two measures defined on the same measurable space (X, M), 
then, for positive numbers a and B, we may define a new measure pz on X by setting 


u3(E) = a- wy (E)+B- po(E) for all Ein M. 


It turns out to be important to consider set functions that are linear combinations of measures 
but with coefficients that may be negative. What happens if we try to define a set function p 
on M by 


v(E) = w1(E) — w2(E) for all E in M? 
The first thing that may occur is that v is not always nonnegative. Moreover, v(£) is not 


even defined for E ¢ M such that w;(E) = uo(E) = oo. With these considerations in mind 
we make the following definition. 


Definition By a signed measure v on the measurable space (X, M) we mean an extended 
real-valued set function v: M — [—o0, oo] that possesses the following properties: 


(i) v assumes at most one of the values +00, —oo. 
(ii) v(B) =0. 
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(i) For any countable collection {E k}pe1 Of disjoint measurable sets, 
10. @) 10. @) 
v(\L)J Ex) = SY) v( Ex), 
k=1 k=1 
10. @) 
where the series ~~, v( Ex) converges absolutely if v(\J Ex) is finite. 
k=1 


A measure is a special case of a signed measure. It is not difficult to see that 
the difference of two measures, one of which is finite, is a signed measure. In fact, the 
forthcoming Jordan Decomposition Theorem will tell us that every signed measure is the 
difference of two such measures. 

Let v be a signed measure. We say that a set A is positive (with respect to v) provided 
A is measurable and for every measurable subset E of A we have v(E) > 0. The restriction 
of v to the measurable subsets of a positive set is a measure. Similarly, a set B is called 
negative (with respect to v) provided it is measurable and every measurable subset of B has 
nonpositive v measure. The restriction of —v to the measurable subsets of a negative set also 
is a measure. A measurable set is called null with respect to v provided every measurable 
subset of it has vy measure zero. The reader should carefully note the distinction between a 
null set and a set of measure zero: While every null set must have measure zero, a set of 
measure Zero may well be a union of two sets whose measures are not zero but are negatives 
of each other. By the monotonicity property of measures, a set is null with respect to a 
measure if and only if it has measure zero. Since a signed measure v does not take the values 
oo and —oo, for A and B measurable sets, 


if A C B and |v( B)| < oo, then |»(A)| < 00. (3) 


Proposition 4 Let v be a signed measure on the measurable space (X, M). Then every 
measurable subset of a positive set is itself positive and the union of a countable collection of 
positive sets is positive. 


Proof The first statement is trivially true by the definition of a positive set. To prove the 
second statement, let A be the union of a countable collection {A;}°, of positive sets. Let E 
be a measurable subset of A. Define E; = EM A. For k > 2, define 


Ey = [EN Ay] ~[A, U...U Ag_y]. 


Then each E; is a measurable subset of the positive set A, and therefore v( E;,) > 0. Since E 
is the union of the countable disjoint collection {Ex}, 


OO 
v(E)= S v( Ex) = 0. 
k=1 
Thus A is a positive set. O 
Hahn’s Lemma _ Let v be a signed measure on the measurable space (X, M) and E a 


measurable set for which 0 < v( E) < oo. Then there is a measurable subset A of E that is 
positive and of positive measure. 
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Proof If E£ itself is a positive set, then the proof is complete. Otherwise, E contains sets of 
negative measure. Let m, be the smallest natural number for which there is a measurable set 
of measure less than —1/m ,. Choose a measurable set E, C E with v( E,) < —1/m,. Let n 
be a natural number for which natural numbers m,,...,m, and measurable sets E),..., Ey, 
have been chosen such that, for 1 < k <n, my, is the smallest natural number for which there 
is a measurable subset of E ~ Ui E; of measure less than —1/m, and E;, is a subset of 


[E~ Ulz; Ej] for which v( Ex) < —1/me. 
If this selection process terminates, then the proof is complete. Otherwise, define 


1, @) 
A=E~\) &, sothatE=AU 
k=1 


0O 
lJ a is a disjoint decomposition of E. 
k=1 

Since 7°, Ex is a measurable subset of E and |v(E)| < oo, by (3) and the countable 
additivity of v, 


—o~ < ie & = s v( Ex) < s —1/mx. 
k=1 k=1 


k=1 
Thus limy-,o.m; = oo. We claim that A is a positive set. Indeed, if B is a measurable subset 
of A, then, for each k, 
k-1 
BCACE~ || J&Ej/, 
j=l 
and so, by the minimal choice of m;, v(B) > —1/(m,z —1). Since limy_, 99 my = 00, we have 
v(B) > 0. Thus A is a positive set. It remains only to show that v(A) > 0. But this follows 


from the finite additivity of v since v(E) >O and v(E~ A) = »(U®, Ex) = S (Ex) <0.0 
k=1 


The Hahn Decomposition Theorem Let v be a signed measure on the measurable space 
(X, M). Then there is a positive set A for v and a negative set B for v for which 


X=AUBand ANB=¥9%. 


Proof Without loss of generality we assume +00 is the infinite value omitted by v. Let P be 
the collection of positive subsets of X and define A = sup {v(E)| E € P}. Then A > 0 since 
P contains the empty set. Let {A,}7°2, be a countable collection of positive sets for which 
A = limy_,o0 v( Ay). Define A = UZ? Ax. By Proposition 4, the set A is itself a positive set, 
and so A > v(A). On the other hand, for each k, A~ A; C A and so v(A~ Ax) > 0. Thus 


v(A) = v(Ay) + v(A~ Ag) = v(Ax). 
Hence v(A) > A. Therefore v( A) = A, and A < oo since v does not take the value oo. 


Let B = X~A. We argue by contradiction to show that B is negative. Assume B is 
not negative. Then there is a subset E of B with positive measure and therefore, by Hahn’s 
Lemma, a subset Eo of B that is both positive and of positive measure. Then A U Ep is a 
positive set and 

v(AU Eg) =v(A)+v( Eo) >A, 


a contradiction to the choice of A. LJ 
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A decomposition of X into the union of two disjoint sets A and B for which A is positive 
for v and B negative is called a Hahn decomposition for v. The preceding theorem tells us of 
the existence of a Hahn decomposition for each signed measure. Such a decomposition may 
not be unique. Indeed, if {A, B} is a Hahn decomposition for v, then by excising from A a null 
set E and grafting this subset onto B we obtain another Hahn decomposition {A ~ E, BU E}. 

If {A, B} is a Hahn decomposition for v, then we define two measures v+ and v~ with 
vy=vt —v~ by setting 


v*(E) =v(ENA) andy (E£) = —v(ENB). 


Two measures ; and v2 on (X, M) are said to be mutually singular (in symbols vy; 1 v2) 
if there are disjoint measurable sets A and B with X = AU B for which p, (A) =12(B) =0. 
The measures v* and v~ defined above are mutually singular. We have thus established the 
existence part of the following proposition. The uniqueness part is left to the reader (see 
Problem 13). 


The Jordan Decomposition Theorem Let v be a signed measure on the measurable space 
(X, M). Then there are two mutually singular measures v+ and v~ on (X, M) for which 
vy =v —v™. Moreover, there is only one such pair of mutually singular measures. 


The decomposition of a signed measure v given by this theorem is called the Jordan 
decomposition of v. The measures vt and v~ are called the positive and negative parts (or 
variations) of v. Since »v assumes at most one of the values --oo and —oo, either vt or vy 
must be finite. If they are both finite, we call v a finite signed measure. The measure |p| is 
defined on M by 

|v|(E) =v*(E)+v7(E) forall Ee M. 


We leave it as an exercise to show that 


vl(X) = sup ¥ p(E)I, (4) 


k=1 


where the supremum is taken over all finite disjoint collections {E,}/_, of measurable subsets 
of X. For this reason |v|(X) is called the total variation of v and denoted by [vl var- 


Example Let f: R > R be a function that is Lebesgue integrable over R. For a Lebesgue 
measurable set E, define v(E) = f,, fdm. We infer from the countable additivity of 
integration (see page 90) that v is a signed measure on the measurable space (R, £). Define 
A= {xe R| f(x) > 0} and B = {x € R| f(x) <0} and define, for each Lebesgue measurable 
set E, 
vi(E)= fdmandv"(E)=-— f dm. 
ANE BNE 

Then {A, B} is a Hahn decomposition of R with respect to the signed measure v. Moreover, 
vy =vt —v7 is a Jordan decomposition of v. 


PROBLEMS 


12. In the above example, let E be a Lebesgue measurable set such that 0 < v(E) < oo. Finda 
positive set A containedin E for whichy(A)>0. 
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13. Let w be a measure and jz; and p2 be mutually singular measures on a measurable space 
(X, w) for which » = py — pz. Show that 2 = 0. Use this to establish the uniqueness 
assertion of the Jordan Decomposition Theorem. 


14. Show that if E is any measurable set, then 
—v(E) <v(E) <v*(E) and |»(E)| < |»|(£). 


15. Show that if y; and v2 are any two finite signed measures, then so is av; + Bv2, where a and B 
are real numbers. Show that 


lav| = |a||y| and [vy + v2] < [vi] + [val 


where v < means »( E) < p( E) for all measurable sets E. 
16. Prove (4). 


17. Let wand vbe finite signed measures. Define wp Av = ( pt+v—|~—v|) and wVv = p+y—pAv. 
(i) Show that the signed measure yz Av is smaller than y and v but larger than any other 
signed measure that is smaller than yw and v. 


(ii) Show that the signed measure py V v is larger than u and v but smaller than any other 
measure that is larger than y and v. 


(iii) If 4 and v are positive measures, show that they are mutually singular if and only if 
Av =0. 


17.3 THE CARATHEODORY MEASURE INDUCED BY AN OUTER MEASURE 


We now define the general concept of an outer measure and of measurability of a set with 
respect to an outer measure, and show that the Carathéodory strategy for the construction 
of Lebesgue measure on the real line is feasible in general. 


Definition A set function 1: S > [0, 00] defined on a collection S of subsets of a set X 
is called countably monotone provided whenever a set E € S is covered by a countable 
collection {E,}7°, of sets in S, then 


OO 


w(E) < > w( Ex). 


k=1 


As we already observed, the monotonicity and countable additivity properties of a 
measure tell us that a measure is countably monotone. If the countably monotone set 
function jz: S — [0, oo] has the property that @ belongs to S and u(@) = 0, then p is finitely 
monotone in the sense that whenever a set E € S is covered by a finite collection {E;,}7_, of 
sets in S, then 


n 
u(E) < 3) u( Ex). 
k=1 
To see this, set E, = 9 for k >n. In particular, such a set function » is monotone in the sense 
that if A and B belong to S and AC B, then u(A) < p(B). 


Definition A set function p*: 2% — [0, oo] is called an outer measure provided p*(0) = 0 
and y.* is countably monotone. 


is ee Rie ee 
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Guided by our experience in the construction of Lebesgue measure from Lebesgue 
outer measure on the real line, we follow Constantine Carathéodory and define the 
measurability of a set as follows. 


Definition For an outer measure *: 2* — [0, oo], we call a subset E of X measurable (with 
respect to 4") provided for every subset A of X, 


w*(A) =p*(ANE)+p*(AN ES). 


Since .* is finitely monotone, to show that E C X is measurable it is only necessary to 
show that 


B*(A) > w*(ANE) + p*(AN ES) for all A C X such that p*(A) <n. 
Directly from the definition we see that a subset E of X is measurable if and only if its 
complement in X is measurable and, by the monotonicity of y*, that every set of outer 
measure zero is measurable. Hereafter in this section, 4.*: 2* — [0, co] is a reference outer 
measure and measurable means measurable with respect to p*. 
Proposition 5 The union of a finite collection of measurable sets is measurable. 
Proof We first show that the union of two measurable sets is measurable. Let E; and E> 


be measurable. Let A be any subset of X. First using the measurability of £), then the 
measurability of £2, we have 


wt(A) =p*(ANE1)+p*(AN ES) 
= p*(AN E)) + u*([AN Ef] N Ez) + u*([AN ES] N ES). 
Now use the set identities 
[AN EY]N ES = AN[E, U Ey]° 


and 
[AN E\]U[AN E.N Ef] = AN[E, VU Ey], 


together with the finite monotonicity of outer measure, to obtain 
wX(A) =p*(ANE))+p*(AN ES) 
= w(ANE)) +p*([AN Ef] N Ep) + p*([AN EC]N ES) 
= pt(ANE1) +y*([AN EC]N Ep) + y*(AN[E, U EY) 


= w*(AN [Ey U E2]) + w*(A NE) U Ep]®). 


348 Chapter17 General Measure Spaces: Their Properties and Construction 


Thus E; U E> is measurable. Now let {E;}7_, be any finite collection of measurable 
sets. We prove the measurability of the union LU;_, Ex, for general n, by induction. This is 
trivial for n = 1. Suppose it is true for n — 1. Thus, since 


n n—1 
LJ Fe =|U Ek| UE, 
and the union of two measurable sets is measurable, the set U,_, Ex is measurable. LJ 


Proposition 6 Let A C X and {E,}7_, be a finite disjoint collection of measurable sets. Then 


n n 
p* anil by = > (AN E,). 
k=1 k=1 
In particular, the restriction of 1* to the collection of measurable sets is finitely additive. 


Proof The proof proceeds by induction on n. It is clearly true for n = 1, and we assume it is 
true for n — 1. Since the collection {E;}/_, is disjoint, 


n 
A 6] Ue|ne = ane, 
k=1 


n 
A nO ee] ne = ar 
k=1 


n—-1 
7, Ex}. 
k=1 

Hence by the measurability of E, and the induction assumption, we have 


n n—1 
at N)U Ex | (ante (4 N|U a 
k=1 k=1 
n-1 
= p*(AN En) + 5 w*(AN Ey) 
k=1 
= *(AN Ex). 


Proposition 7 The union of a countable collection of measurable sets is measurable. 


Proof Let E = UP, Ex, where each E;, is measurable. Since the complement in X of 
a measurable set is measurable and, by Proposition 5, the union of a finite collection 
of measurable sets is measurable, by possibly replacing each EF, with Ex, ~ Ui Ej, we 
may suppose that {E;,}?°, is disjoint. Let A be any subset of X. Fix an index n. Define 
F, =U?_, Ex. Since F, is measurable and FC D E°, we have 


wt(A) = et(AN Fy) +u*(ANF,) > w(ANF,) +B*(ANES). 
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By Proposition 6, 


w(AN F,) = >) w*(AN Ey). 
i=l 


Thus 
n 
A)= ae (AN Ex) +p*(AN ES). 


The left-hand side of this inequality is independent of n and therefore 
oo 
w(A)> Si w*(AN Ey) + p(AN ES). 
k=1 


By the countable monotonicity of outer measure we infer that 
w*(A) > w*(ANE) + u*(ANES). 
Thus E is measurable. O 


Theorem 8 Let u* be an outer measure on 2*. Then the collection M of sets that are 
measurable with respect to 1* is a o-algebra. If pt is the restriction of * to M, then (X, M, 2) 
is a complete measure space. 


Proof We already observed that the complement in X of a measurable subset of X also is 
measurable. According to Proposition 7, the union of a countable collection of measurable 
sets is measurable. Therefore M is a o-algebra. By the definition of an outer measure, 
»*(®) = 0 and therefore % is measurable and (@) = 0. To verify that jz is a measure on 
M, it remains to show it is countably additive. Since u* is countably monotone and p* is an 
extension of jz, the set function 7 is countably monotone. Therefore we only need show that 
if {Ex}? , is a disjoint collection of measurable sets, then 


“(0 B > 3 u*(Ex). (5) 
k=1 k=1 


However, y* is monotone and, by taking A = X in Proposition 7, we see that px* is additive 
over finite disjoint unions of measurable sets. Therefore, for each n, 


OO n n 
" (C s| = “(U A =D a*(Et). 
k=1 k=1 k=1 
The left-hand side of this inequality is independent of n and therefore (5) holds. O 


17.4 THE CONSTRUCTION OF OUTER MEASURES 


We constructed Lebesgue outer measure on subsets of the real line by first defining the 
primitive set function that assigns length to a bounded interval. We then defined the outer 
measure of a set to be the infimum of sums of lengths of countable collections of bounded 
intervals that cover the set. This method of construction of outer measure works in general. 
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Theorem 9 Let S be a collection of subsets of a set X and js: S — [0, oo] a set function. 
Define p*(@) = 0 and for E C X, E¥9@, define 


p*(E) = inf > (Ee), (6) 
k= 


where the infimum is taken over all countable collections {E,}?~_, of sets in S that cover E) 
Then the set function p*: 2* — [0, 00] is an outer measure called the outer measure induced 


by p. 

Proof To verify countable monotonicity, let {E;,}?°, be a collection of subsets of X that 
OO 

covers a set E. If u*( E;,) = 00 for some k, then p*(E) < > p*( E;,) = ov. Therefore we 
k=1 


may assume each EF; has finite outer measure. Let ¢« > 0. For each k, there is a countable 
collection {E;x}"°, of sets in S that covers E, and 


a € 
> B(Ein) <b (Ex) + =F 
i=] 


Then {Ejx}1<k,i<oo is a countable collection of sets in S that covers U7, Ex; and therefore 
also covers E. By the definition of outer measure, 


1<k,i<oo 
< > w*(Ex)+ > €/2! 
= D pw*(Ex) + 
k=1 
Since this holds for all € > 0, it also holds for € = 0. LI 


Definition Let S be a collection of subsets of X, w: S — [0, 00] a set function, and p* the 
outer measure induced by . The measure 2 that is the restriction of * to the a-algebra M of 
p*-measurable sets is called the Caratheodory measure induced by /. 


p*: 2* — [0, oo] 
(the induced outer measure) 


p: S > [0, 00 ph: M = (0, oo] 
(a general set function) (the induced Carathéodory measure) 


The Carathéodory Construction 


1We follow the convention that the infimum of the empty-set is co. Therefore a subset E of X that cannot be 
covered by a countable collection of sets in S has outer measure equal to oo. 
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For a collection S of subsets of X, we use S, to denote those sets that are countable 
unions of sets of S and use S,s to denote those sets that are countable intersections of sets 
in S,. Observe that if S is the collection of open integrals of real numbers, then S, is the 
collection of open subsets of R and Sz is the collection of Gs subsets of R. 

We proved that a set E of real numbers is Lebesgue measurable if and only if it 
is a subset of a Gs set G for which G~E has Lebesgue measure zero: see page 40. The 
following proposition tells us of a related property of the Carathéodory measure induced by 
a general set function. This property is a key ingredient in the proof of a number of important 
theorems, among which are the proofs of the Carathéodory-Hahn Theorem, which we prove 
in the following section, and the forthcoming theorems of Fubini and Tonelli. 


Proposition 10 Let jz: S — [0, 00] be a set function defined on a collection S of subsets of a 
set X and jt: M = [0, 00] the Carathéodory measure induced by p. Let E be a subset of X 
for which p*( E) < oo. Then there is a subset A of X for which 


AéSgs, EC Aand p*(E) = p"(A). 
Furthermore, if E and each set in S is measurable with respect to *, then so is A and 
B(A~ E) =0. 
Proof Let « > 0. We claim that there is a set A. for which 
Ac € Sg, EC A, and p*( A.) < p*(E) +. (7) 


Indeed, since .*( E) < 00, there is a cover of E by a collection {E,}° , of sets in S for which 


OO 


Sw Ex) <u(E) +e 
k=1 


Define Ac = UX, Ex. Then A, belongs to S, and E C A,. Furthermore, since {E,}?°, is a 
countable collection of sets in S that covers A,., by the definition of the outer measure p*, 


oO 
< 2m (Ex) < p*(E) +e. 


Thus (7) holds for this choice of A¢. 


Define A =), A1/z- Then A belongs to Sgg and E is a subset of A since E is a subset 
of each A;/,. Moreover, by the monotonicity of 4* and the estimate (7), 


1 
w'(E) <w(A) <p(Arje) SHE) + = for all k. 


Thus p*(E) = p*(A). 

Now assume that E is «*-measurable and each set in S is x*-measurable. Since the 
measurable sets are a o-algebra, the set A is measurable. But yu* is an extension of the 
measure p. Therefore, by the excision property of measure, 


yi(A~ E) =p(A) —f(E) = p*(A)—p*(E) =0. O 
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PROBLEMS 


18. Let p*: 2* — [0, oo] be an outer measure. Let A C X, {Ex}, be a disjoint countable 
collection of measurable sets and E = Ure, Ex. Show that 


19. Show that any measure that is induced by an outer measure is complete. 


20. Let X be any set. Define n: 2* — [0, oo] by defining (0) = 0 and for EC X, E#9, defining 
7( E) = oo. Show that 7 is an outer measure. Also show that the set function that assigns 0 to 
every subset of X is an outer measure. 


21. Let X be aset, S = {9, X} and define (0) = 0, u(X) = 1. Determine the outer measure p* 
induced by the set function : S -> [0, 00) and the a-algebra of measurable sets. 


22, On the collection S = {9, [1, 2]} of subsets of R, define the set function : S > [0, 00) as 
follows: u(@) = 0, u([1, 2]) = 1. Determine the outer measure y* induced by and the 
o-algebra of measurable sets. 


23. On the collection S of all subsets of R, define the set function 1: S > R by setting (A) 
to be the number of integers in A. Determine the outer measure z* induced by uw and the 
o-algebra of measurable sets. 


24. Let S be a collection of subsets of X and yz: S -> [0, co] a set function. Is every set in S 
measurable with respect to the outer measure induced by 1? 


17.5 THE CARATHEODORY-HAHN THEOREM: THE EXTENSION OF A PREMEASURE 
TO A MEASURE 


Let uw: S — [0, oo] be a set function that is defined on a nonempty collection S of subsets 
of a set X. We ask the following question: What properties must the collection S and set 
function possess in order that the Carathéodory measure ji induced by p be an extension 
of 2: that is, every set E in S is measurable with respect to the outer measure p* induced by 
» and, moreover, u(E) = u*(E)? We will identify necessary properties that the set function 
#. must possess for this to be so and show that these same properties are sufficient, provided 
the collection S has finer set-theoretic structure. 

We call a set function w: S — [0, oo] finitely additive provided whenever {Ex};_1 18 a 
finite disjoint collection of sets in S and Ur_1 Ex also belongs to S, then 


(0 f| = ¥ p( Et). 
k=1 k=1 


Proposition 11 Let S be a collection of subsets of a set X and 1: S > [0, 00] a set function. 
In order that the Carathéodory measure induced by p be an extension of it is necessary that 
u. be both finitely additive and countably monotone and, if ® belongs to S, that 1(®) = 0. 


Proof Let (X,M, /) denote the Carathéodory measure space induced by wu and suppose 
z: M — [0, oo] extends uw: S — [0, oo]. First of all, observe that if @ belongs to S, then 
#(8) = (0) = Osince wis a measure that extends p. Now let {Ex},;_, be a disjoint collection 
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of sets in S such that U?_, Ex also belongs to S. A measure is finitely additive since it is 
countably additive and the empty-set has measure zero. Therefore, since i extends p, 


(0 B = (U a = s E( Ex) = S (Ex). 
kal kal kal kal 


Thus p is finitely additive. To establish countable monotonicity observe that u(£) = u*(E 
for all E € S if and only if y is countably monotone. Thus if jz extends p, w*(E) = p(E) 
p( E) for all E € S and hence yp is countably monotone. 


~~” 


This proposition suggests that it is useful to single out and name the following class of 
set functions. 


Definition Let S be a collection of subsets of a set X and jz: S — [0, 00] a set function. Then 
pis called a premeasure provided y is both finitely additive and countably monotone and, if 
0 belongs to S, then 1(%) = 0. 


Being a premeasure is a necessary but not sufficient condition for the Carathéodory 
measure induced by p to be an extension of pz (examine the premeasures defined in Problems 
25 and 26). However, if we impose on S finer set-theoretic structure, this necessary condition 
is also sufficient. 


Definition A collection S of subsets of X is said to be closed with respect to the formation 
of relative complements provided whenever A and B belong to S, the relative complement 
A~B belongs to S. The collection S is said to be closed with respect to the formation of finite 
intersections provided whenever A and B belong to S, the intersection AQ B belongs to S. 


Observe that if a collection of sets S is closed with respect to the formation of relative 
complements, then it is also closed with respect to the formation of finite intersections since 
if A and B belong to S so does 

ANB=A~[A~Bl. 


Also observe that if a nonempty collection of sets S is closed with respect to the formation 
of relative complements, then it contains J. Indeed, § = A~ A, where A belongs to S. 


Theorem 12 Let 1: S — [0, 00] be a premeasure on a nonempty collection S of subsets of 
X that is closed with respect to the formation of relative complements. Then the Carathéodory 
measure jt: M — [0, oo] induced by yp is an extension of w: it is called the Carathéodory 
extension of i. 


Proof Let A belong to S. To show that A is measurable with respect to the outer measure 
induced by y it suffices to let E be any subset of X of finite outer measure, let « > 0 and 
verify that 

p*(E)+e> p(ENA)+p(ENAS). (8) 


By the definition of outer measure, there is a collection {E;}7° , of sets in S that covers E and 


u(E) +e ¥ aE) (9) 
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However, for each k, since S is closed with respect to the formation of relative complements, 
E, ~ A belongs to S and so does Ex A = Ey ~[E,~ A]. A premeasure is finitely additive. 
Therefore 

W( Ex) = w( Ee A) + w( EN AS). 


Sum these inequalities to conclude that 
OO OO OO 
> ME) = Se ENA) + ¥ p( EN AS). (10) 
k=1 k=1 k=1 


Observe that {E, N A}, and {EN AC} are countable collections of sets in S that cover 
EN Aand EN A‘, respectively. Therefore, by the very definition of outer measure, 


OO 1°, 6) 
> M(ELNA) > w*(ENA) and S w(E,N AC) > p(EN A). 
k=1 k=1 


The desired inequality (8) follows from the these two inequalities together with (9) and (10). 


Clearly 4(E) = w*(E) for each set E € S if and only if 2 is countable monotone. 
Hence for each E € S, w(E) = w*(E) and therefore, since each set E € S is measurable, 
w(E) = p(B). U 


Remark Observe the quite distinct roles played by the two properties of a premeasure in the 
proof of the above theorem. We used the finite additivity of to infer that every set in S is 
p*-measurable. The countable monotonicity of wis equivalent to the equality u(E) = p*(E) 
forallE €S. 


A number of natural premeasures, including the premeasure length defined on the 
collection of bounded intervals of real numbers, are defined on collections of sets that are not 
closed with respect to the formation of relative complements. However, we now introduce 
the notion of a semiring. We show that a semiring S has the property that every premeasure 
on S has a unique extension to a premeasure on a collection of sets that is closed with 
respect to the formation of relative complements. This purely set-theoretic result, together 
with Theorem 12, will be used to show that premeasures on semirings are extended by their 
induced Carathéodory measure. 


Definition A nonempty collection S of subsets of a set X is called a semiring provided 
whenever A and B belong to S, then A( B also belongs to S and there is a finite disjoint 
collection {C,}"_, of sets in S for which 


A~B= U Cr. 
k=1 


Proposition 13 Let S be a semiring of subsets of a set X. Define S' to be the collection of 
unions of finite disjoint collections of sets in S. Then S' is closed with respect to the formation 
of relative complements. Furthermore, any premeasure on S has a unique extension to a 
premeasure on S’. 
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Proof It is clear that S’ is closed with respect to the formation of finite unions and finite 
intersections. Let {Ax};_, and {B ea be two finite disjoint collections of sets in S. Observe 
that 

Ua ~ 183i] =U [Nae] - (11) 
k=1 j=l 


k=1 | j=1 


Since each A; ~ B; belongs to S’ and S’ is closed with respect to the formation of finite unions 
and finite intersections, we infer from (11) that S’ is closed with respect to the formation of 
relative complements. 


Let x: S > [0, co] be a premeasure on S. For E C X such that E = Uj_ Ax, where 
n 
{Ax}%_, is a disjoint collection of sets in S, define p'(E) = & w( Ax). To verify that w(E) 
k=l 


is properly defined, let E also be the disjoint union of the finite collection {B nes of sets in 
S. We must show that 


S u(B;) = 3 w( Ar). 


j=l k=1 
However, by finite additivity of a premeasure, 


n 


w( Bj) = > w(BjO Ax) forl < j<m 


k=1 
and , 
w( Ag) = > w(BjM Ag) for 1 <k <n. 
j=l 
Therefore 


¥ u(B)) => » mB.) => a8) At) = ¥ u(r). 


j=l j=l 


Thus p’ is properly defined on S. 


It remains to show that p’ is a premeasure on S’. Since py’ is properly defined it 
inherits finite additivity from the finite additivity possessed by jz. To establish the countable 
monotonicity of yz’, let E € S’ be covered by the collection {E,}7°., of sets in S’. Without loss 
of generality we may assume that {E;,}, is a disjoint collection of sets in S (see part (iii) of 
Problem 31). Let E = U%_, Aj, where the union is disjoint and each A; belongs to S. For 
each j, A; is covered by U2. (Aj Ex), a countable collection of sets in S and therefore, by 
the countable monotonicity of p, 

OO 
(Aj) < 2 w(Ay 1 Ex). 
k= 


Thus, by the finite monotonicity of p, 
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OO 
< Ye (Ex). 
k=1 
Therefore 1’ is countably monotone. The proof is complete. O 


For S a collection of subsets of X, a set function w: S — [0, oo] is said to be o-finite 
provided X =, S; where for each k, S; € S and 1( Sx) < 00. 


The Carathéodory-Hahn Theorem = Let yu: S > [0, 00] be a premeasure on a semiring 
S of subsets of X. Then the Carathéodory measure ji induced by p is an extension of wp. 
Furthermore, if u is o-finite, then so is ji and pi is the unique measure on the o-algebra of 
p*-measurable sets that extends p. 


p: M — [0, oo] 
(the Carathéodory extension) 
A 


ae 
p*:2* —> [0, 00] 


| 

| 

| 

| (the induced outer measure) 
| 

| 

| 


pe: S > [0, oo] 
(a premeasure on a semiring S) 


The Carathéodory Construction Extends a Premeasure on a Semiring to a Measure 


Proof We infer from Theorem 12 and Proposition 13 that 7 extends jt. Now assume that pu 
is o-finite. To prove uniqueness, let 4; be another measure on M that extends p. We express 
X =U, Xx, where the union is disjoint and for each k, X; belongs to S and p(X;) < oo. 
By the countable additivity of a measure, to prove uniqueness it suffices to show that pi and 
4; agree on the measurable sets contained in each X,. Let E be measurable with E C Eo, 
where Eo € S and p( Eo) < 00. We will show that 


BE) = (BE). (12) 
According to Proposition 10, there is a set A € S,s for which E C A and H(A~ E) =0. We 
may assume that A C Eo. However, by the countable monotonicity of 1, if B is measurable 
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and y*(B) = 0, then 41(B) = 0. Therefore 4;(A~E) = 0. On the other hand, by the 
countable additivity of 4; and jz, these measures agree on S,, and therefore by the continuity 
of measure they agree of the subsets of Ey which belong to S,5. Therefore (A) = (A). 
Hence | 7 

yi(A~E) =—(A~E) and w(A) = (A), 


and so (12) is verified. © | | CL] 


Corollary 14 Let S be a semiring of subsets of a set X and B the smallest o-algebra of subsets 
of X that contains S. Then two o-finite measures on B are equal if and only if they agree on 
sets in S. 


The set-theoretic restrictions on the collection of sets S that are imposed in the 
Carathéodory-Hahn Theorem are satisfied in a number of important cases. For example, the 
collection of bounded intervals of real numbers and the collection of subsets of the plane 
R’ that are Cartesian products of bounded intervals of real numbers are semirings (see Pro- 
blem 33). Moreover, the collection of bounded intervals in R” is a semiring. This will permit 
us to construct Lebesgue measure on R” by use of the Carathéodory construction. 

We note that the uniqueness assertion in the Carathéodory -Hahn Theorem may fail if 
the premeasure is not assumed to be o-finite (see Problem 32). 

It is useful for the reader to be familiar with some of the vocabulary associated with 
properties of collections S of subsets of a set X. A collection S is called a ring of sets 
provided it is closed with respect to the formation of finite unions and relative complements 
and, therefore, with respect to the formation of finite intersections. A ring that contains X is 
called an algebra while a semiring that contains X is called a semialgebra. 


PROBLEMS 


25. Let X be any set containing more than one point and A a proper nonempty subset of X. 
Define S = {A, X} and the set function : S > [0, oo] by (A) = 1 and p(X) = 2. Show 


that 4: S — [0, oo] is a premeasure. Can pz be extended to a measure? What are the subsets 


of X that are measurable with respect to the outer measure j:* induced by 1? 
26. Consider the collection S = {9, (0, 1], 0, 3], [2, 3]} of subsets of R and define p(B) = 0, 
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w([0, 1]) = 1, w([0, 3]) = 1, w([2, 3]) = 1. Show that yu: S > [0, oo] is a premeasure. Can 


p. be extended to a measure? What are the subsets of R that are measurable with respect to 
the outer measure y* induced by 1? 


27. Let S be a collection of subsets of a set X and yw: S — [0, ov] aset t function. Show that eis 
countably monotone if and only if u* is an extension of pu. 


28. Show that a set function is a premeasure if it has an extension that is a measure. 
29. Show that a set function on a g-algebra is a measure if and only if it is a premeasure. 


30. Let S be a collection of sets that is closed with respect to the formation of finite unions and 
finite intersections. 
(i) Show that S, is closed with respect to the formation of countable unions and finite 
intersections. 


(ii) Show that each set in Ss is the intersection of a decreasing sequence of S, sets. 
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31. 


32. 


33. 


35. 


Let S be a semialgebra of subsets of a set X and S’ the collection of unions of finite disjoint 
collections of sets in S. 


(i) Show that S’ is an algebra. 
(ii) Show that S, = S’. and therefore S,5 = Si5- 


(iii) Let {E,}?°, be a collection of sets in S’. Show that we can express? , E;, as the disjoint 
union U7’, Ex of sets in S for which 


Sue) > S ul By). 


k=1 k=1 


(iv) Let A belong to S’,;. Show that A is the intersection of a descending sequence {An}, 
of sets in S,. 


Let Q be the set of rational numbers and S the collection of all finite unions of intervals of the 
form (a, b]N Q, where a, b € Q anda < b. Define (a, b] = 00 ifa < band p(B) = 0. Show 
that S is closed with respect to the formation of relative complements and yw: S > [0, oo] isa 
premeasure. Then show that the extension of y to the smallest o-algebra containing S is not 
unique. 

By a bounded interval of real numbers we mean a set of the form [a, b], [a, b),(a, b] or 
(a, b) for real numbers a < b. Thus we consider the empty-set and a set consisting of a single 
point to be a bounded interval. Show that each of the following three collections of sets S is 
a semiring. 

(i) Let S be the collection of all bounded intervals of real numbers. 


(i) Let S be the collection of all subsets of R X R that are products of bounded intervals of 
real numbers. 


(ii) Let n be a natural number and X be the n-fold Cartesian product of R: 


n times 
eee 
X=RxX.--XR. 


Let 5 be the collection of all subsets of X that are n-fold Cartesian products of bounded 
intervals of real numbers. 


. If we start with an outer measure y* on 2* and form the induced measure pon the 


w*-measurable sets, we can view @@ as a set function and denote by w* the outer measure 
induced by jy. 


(i) Show that for each set E C X we have u+(E) > p*(E). 

(ii) For a given set E, show that u*+(E) = u*(E) if and only if there is a u*-measurable set 
ADE with u*(A) = p*(E). 

Let S be a o-algebra of subsets of X and w: S — [0, oo] a measure. Let @: M — [0, oo] be 


the measure induced by yu via the Carathéodory construction. Show that S is a subcollection 
of M and it may be a proper subcollection. 


. Let «be a finite premeasure on an algebra S, and y* the induced outer measure. Show that a 


subset E of X is w*-measurable if and only if for each ¢ > 0 there is a set A € Ss, AC E, such 
that u*(E~ A) <e. 
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We begin the study of integration over general measure spaces by devoting the first section 
to the consideration of measurable functions. Much of this is quite similar to the study of 
Lebesgue measurable functions on a single real variable. Our approach to general integration 
differs from the one we pursued in Chapter 4 for integration with respect to Lebesgue mea- 
sure for functions of a real variable. In Section 2, we first define the integral for a nonnegative 
simple function and then directly define the integral of a nonnegative measurable function 
f as the supremum of integrals of nonnegative simple functions for which 0 <  < f. 
At this early stage we establish the general Fatou’s Lemma, which is the cornerstone of 
the full development of the integral, and its close relatives, the Monotone Convergence 
Theorem and Beppo Levi’s Lemma. In the third section, we consider integration for general 
measurable functions and establish the linearity and monotonicity properties of the integral, 
the continuity, and countable additivity of integration, and the Integral Comparison Test 
and Vitali Convergence Theorem. In Section 4, we introduce the concept of absolute con- 
tinuity of one measure with respect to another and prove the Radon-Nikodym Theorem, a 
far-reaching generalization of the representation of absolutely continuous functions of a real 
variable as indefinite integrals. We also establish the Lebesgue Decomposition Theorem 
for measures. The chapter concludes with an application of the Baire Category Theorem 
to prove the Vitali-Hahn—Nikodym Theorem, which tells us of very general assumptions 
under which the setwise limit of a sequence of measures is again a measure. 


18.1 MEASURABLE FUNCTIONS 


For a measurable space (X, M), the concept of a measurable function on X is identical 
with that for functions of a real variable with respect to Lebesgue measure. The proof of the 
following proposition is exactly the same as the proof for Lebesgue measure on the real line: 
see page 54. 


Proposition 1 Let (X, M) be a measurable space and f an extended real-valued function 
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defined on X. Then the following statements are equivalent: 


(t) For each real number c, the set {x € X | f(x) <c} is measurable. 
(ii) For each real number c, the set {x € X | f(x) < c} is measurable. 
(iii) For each real number c, the set {x € X | f(x) > c} is measurable. 
(iv) For each real number c, the set {x € X | f(x) > c} is measurable. 


Each of these properties implies that for each extended real number c, 


the set {x EX | f(x) =c} is measurable. 


Definition Let (X, M) be a measurable space. An extended real-valued function f on X is 
said to be measurable (or measurable with respect to M) provided one, and hence all, of the 
four statements of Proposition 1 holds. 


For a set X and the c-algebra M = 2% of all subsets of X, every extended real-valued 
function on X is measurable with respect to M. At the opposite extreme, consider the 
g-algebra M = {X, 6}, with respect to which the only measurable functions are those that 
are constant. If X is a topological space and M is a c-algebra of subsets of X that contains the 
topology on X, then every continuous real-valued function on X is measurable with respect 
to M. In Part 1 we studied functions of a real variable that are measurable with respect to 
the o-algebra of Lebesgue measurable sets. 

Since a bounded, open interval of real numbers is the intersection of two unbounded, 
open intervals and each open set of real numbers is the countable union of a collection of 
open intervals, we have the following characterizaton of real-valued measurable functions 
(see also Problem 1). 


Proposition 2 Let (X, M) be a measurable space and f a real-valued function on X. Then 
f is measurable if and only if for each open set O of real numbers, f~'(O) is measurable. 


For a measurable space (X, M) and measurable subset E of X, we call an extended 
real-valued function f that is defined on E measurable provided it is measurable with 
respect to the measurable space (E, Mz), where Mg is the collection of sets in M that 
are contained in E. The restriction of a measurable function on X to a measurable set is 
measurable. Moreover, for an extended real-valued function f of X and measurable subset 
E of X, the restriction of f to both E and X ~ E are measurable if and only if f is measurable 
on X. 


Proposition 3. Let (X, M, w) be a complete measure space and Xo a measurable subset of 
X for which p(X ~ Xo) = 0. Then an extended real-valued function f on X is measurable if 
and only if its restriction to Xq is measurable. In particular, if g and h are extended real-valued 
functions on X for which g = h a.e. on X, then g is measurable if and only if h is measurable. 


Proof Define fp to be the restriction of f to Xo. Let c be a real number and E = (c, oo). 
If f is measurable, then f~'(£) is measurable and hence so is f-!(E)M Xo = fo '(E). 
Therefore fp is measurable. Now assume fp is measurable. Then 


f\(E)= fy (E)UA, 
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where A is a subset of X ~ Xo. Since (X, M, mw) is complete, A is measurable and hence 
so is f—'(E). Therefore the function f is measurable. The second assertion follows from 
the first. Oo 


This proposition is false if the measure space (X, M, 2) fails to be complete (see Problem 2). 
The proof of the following theorem is exactly the same as the proof in the case of Lebesgue 
measure on the real line; see page 56. 


Theorem 4 Let(X, M) be ameasurable space and f and g measurable real-valued functions 
on X. 


(Linearity) For any real numbers a and B, 
af + Bg is measurable. 


(Products) 
f - gis measurable. 


(Maximum and Minimum) The functions max{ f, g} and min{ f, g} are measurable. 


Remark The sum of two extended real-valued functions is not defined at points where the 
functions take infinite values of opposite sign. Nevertheless, in the study of linear spaces of 
integrable functions it is necessary to consider linear combinations of extended real-valued 
measurable functions. For measurable functions that are finite almost everywhere, we proceed 
as we did for functions of a real variable. Indeed, for a measure space (X, M, u), consider 
two extended real-valued measurable functions f and g on X that are finite a.e. on X. Define 
Xq to be the set of points in X at which both f and g are finite. Since fand g are measurable 
functions, Xo is a measurable set. Moreover, p(X ~ Xo) = 0. For real numbers a and B, 
the linear combination a f + Bg is a properly defined real-valued function on Xo. We say 
that a f + Bg is measurable on X provided its restriction to Xo is measurable with respect to 
the measurable space (Xo, Mo), where Mo is the o-algebra consisting of all sets in M that 
are contained in Xq. If (X, M, m) is complete, Proposition 3 tells us that this definition is 
equivalent to the assertion that one, and hence any, extension of af + Bg on Xo to an extended 
real-valued function on all of X is a measurable function on X. We regard the function a f + Bg 
on X as being any measurable extended real-valued function on X that agrees with af + Bg on 
Xo. Similar considerations apply to the product of f and g and their maximum and minimum. 
With this convention, the preceding theorem holds if the extended real-valued measurable 
functions f and g are finite a.e. on X. 


We have already seen that the composition of Lebesgue measurable functions of a 
single real variable need not be measurable (see the example on page 58). However, the 
following composition criterion is very useful. It tells us, for instance, that if f is a measurable 
function and 0 < p < ov, then | f|? also is measurable. 


Proposition 5 Let (X, M) be a measurable space, f a measurable real-valued function on 
X, and ¢: R - Rcontinuous. Then the composition g o f : X —- Ralso is measurable. 
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Proof Let O be an open set of real numbers. Since ¢ is continuous, g~!() is open. Hence, 
by Proposition 2, f-'(g-!(O)) = (go f)~!(O) is a measurable set and so yo f is a 
measurable function. LI 


A fundamentally important property of measurable functions is that, just as in the 
special case of Lebesgue measurable functions of a real variable, measurability of functions 
is preserved under the formation of pointwise limits. 


Theorem 6 Let (X, M, 1) be a measure space and { f,} a sequence of measurable functions 
on X for which {f,} — f pointwise a.e. on X. If either the measure space (X, M, p) is 
complete or the convergence is pointwise on all of X, then f is measurable. 


Proof In view of Proposition 3, possibly by excising from X a set of measure 0, we 
suppose the sequence converges pointwise on all of X. Fix a real number c. We must 
show that the set {x € X| f(x) < c}is measurable. Observe that for a point x € X, since 
limyoo fn(x) = f(x), f(x) < c if and only if there are natural numbers n and k such that 
for all j > k, fj(x) <c —1/n. But for any natural numbers n and j, since the function f; is 
measurable, the set {x € X | f;(x) <c — 1/n} is measurable. Since M is closed with respect 
to the formation of countable intersections, for any k, 


oo 


(\{xeX| fi(x)<c-1/n} 


isk 


also is measurable. Consequently, 


oo 


{xe X | f(x)<ch= J (\{xeXx| fj(x) <c—1/n} 


1<k,n<oo | j=k 
is measurable since M is closed with respect to the formation of countable unions. LI 
This theorem is false if the measure space fails to be complete (see Problem 3). 


Corollary 7 Let (X, M, ) be a measure space and { f,} a sequence of measurable functions 
on X. Then the following functions are measurable: 


sup {f,}, inf {f,}, limsup{f,}, liminf{/,}. 


Definition Let (X, M) be a measurable space. For a measurable set E, its characteristic 
function, yz, is the function on X that takes the value 1 on E and 0 on X ~ E. A real-valued 
function on X is said to be simple provided there is a finite collection {Ex},_, of measurable 
sets and a corresponding set of real numbers {cx},_, for which 


n 
w= >) ck XE, on X. 
k=1 


Observe that a simple function on X is a measurable real-valued function on X that 
takes a finite number of real values. 
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The Simple Approximation Lemma Let (X, M) be a measurable space and f a measurable 
function on X that is bounded on X, that is, there is an M > 0 for which | f| < M on X. Then 
for each € > 0, there are simple functions p, and , defined on X that have the following 
approximation properties: 


Oe <f<wpeand0<.-o <e€onX. 
Proof Let [c, d) be a bounded interval that contains the image of X, f(X), and 
C= yo<y<...-<Yn-1 <n =a 
a partition of the closed, bounded interval [c, d] such that y, — y,_1 < «for 1 < k <n. Define 
I, = [ye-1, ye) and X, = f-'(K) for1 <k <n. 


Since each J; 1s an interval and the function f is measurable, each set X; is measurable. 
Define the simple functions », and y, on X by 
n n 
ge = >) Ye-1- Xx, and We = DS) ye XX. 
= k=1 


k=l 
Let x belong to X. Since f(X) C [c, d), there is a unique k,1 < k <n, for which 
Ye-1 < f(x) < y, and therefore 


Pe(xX) = ye_-1 < f(x) < ye = e(X). 


But yy — ye-1 < €, and therefore g, and y, have the required approximation properties. L 


The Simple Approximation Theorem Let(X,M, w) beameasure space and f ameasurable 
function on X. Then there is a sequence {y,} of simple functions on X that converges pointwise 
on X to f and has the property that 


lWnl < |flon X forall n. 


(i) If X is o-finite, then we may choose the sequence {ifn} so that each i, vanishes outside 
a set of finite measure. 


(it) If f isnonnegative, we may choose the sequence {jn} to be increasing and each fr, > 0 
on X. 


Proof Fix a natural number n. Define E, = {x € X||f(x)| < n}. Since | f| is a measurable 
function, E, is a measurable set and the restriction of f to E, is a bounded measurable 
function. By the Simple Approximation Lemma, applied to the restriction of f to E, and 
with the choice of € = 1/n, we may select simple functions h, and g, on E,, which have the 
following approximation properties: 


hn < f < 8, and0 < g, —h, <1/non Ey. 


For xin E,, define ¥,(x) = 0 if f(x) = 0, wa(x) = max{h,(x),0} if f(x) > 0 and 
Un(x) = min{g,(x), 0} if f(x) < 0. Extend y,, to all of X by setting y,(x) =nif f(x) >n 
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and W(x) = —n if f(x) < —n. This defines a sequence {y,,} of simple functions on X. It 
follows, as it did in the proof for the case of Lebesgue measurable functions of a real variable 
(see page 62), that, for each n, |Wn| < |f| on X and the sequence {y,,} converges pointwise 
on X to f. 


If X is o-finite, express X as the union of a countable ascending collection {Xn}, of 
measurable subsets, each of which has finite measure. Replace each yw, by Wy - yx, and (i) is 
verified. If f is nonnegative, replace each wy, by max; <j<n |W;| and (ii) is verified. a 


The proof of the following general form of Egoroff’s Theorem follows from the 
continuity and countable additivity of measure, as did the proof in the case of Lebesgue 
measurable functions of a real variable; see page 65. 


Egoroffs Theorem Let (X, M, w) be a finite measure space and {fy} a sequence of 
measurable functions on X that converges pointwise a.e. on X to a function f that is finite a.e. 
on X. Then for each € > 0, there is a measurable subset X, of X for which 


{fn} —> f uniformly on X, and p(X ~ X,) <e. 


PROBLEMS 


In the following problems (X, M, ) is a reference measure space and measurable means 
with respect to M. 
1. Show that an extended real-valued function on X is measurable if and only if f~!{oo} and 
f~!{—0o} are measurable and so is f—!( E) for every Borel set of real numbers. 


2. Suppose (X, M, w) is not complete. Let E be a subset of a set of measure zero that does not 
belong to M. Let f = 0 on X and g = yz. Show that f = ga.e. on X while f is measurable 
and g is not. 


3. Suppose (X, M, w) is not complete. Show that there is a sequence {f,} of measurable 
functions on X that converges pointwise a.e. on X to a function f that is not measurable. 


4. Let E be a measurable subset of X and f an extended real-valued function on X. Show that 
f is measurable if and only if its restrictions to E and X ~ E are measurable. 


5. Show that an extended real-valued function f on X is measurable if and only if for each 
rational number c, {x € X | f(x) <c} is a measurable set. 


6. Consider two extended real-valued measurable functions f and g on X that are finite a.e. on 
X. Define Xo to be the set of points in X at which both f and g are finite. Show that Xo is 
measurable and p(X ~ Xo) = 0. 


7. Let X be anonempty set. Show that every extended real-valued function on X is measurable 
with respect to the measurable space (X, 2*). 


(i) Let xp belong to X and 6,, be the Dirac measure at x9 on 2*. Show that two functions on 
X are equal a.e. [5,,| if and only if they take the same value at xo. 


(ii) Let 7 be the counting measure on 2*. Show that two functions on X are equal a.e. [7] if 
and only if they take the same value at every point in X. 


8. Let X be a topological space and B(X) the smallest o-algebra containing the topology on X. 
B(X) is called the Borel o-algebra associated with the topological space X. Show that any 
continuous real-valued function on X is measurable with respect to the Borel measurable 
space {X, B(X)). 7 
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9. If a real-valued function on R is measurable with respect to the o-algebra of Lebesgue 
measurable sets, is it necessarily measurable with respect to the Borel measurable space 
(R, B(R))? 

10. Check that the proofs of Proposition 1 and Theorem 4 follow from the proofs of the 
corresponding results in the case of Lebesgue measure on the real line. 


11. Complete the proof of the Simple Approximation Lemma. 


12. Prove Egoroff’s Theorem. Is Egoroff’s Theorem true in the absence of the assumption that 
the limit function is finite a.e.? 


13. Let {f,} be a sequence of real-valued measurable functions on X such that, for each natural 
number n, p{x € X||fn(x) — fngi(x)| > 1/2"}1/2". Show that {f,,} is pointwise convergent 
a.e. on X. (Hint: Use the Borel-Cantelli Lemma.) 

14. Under the assumptions of Egoroff’s Theorem, show that X = Urey Xx, Where each X; is 
measurable, (Xo) = 0 and, for k > 1, {f,} converges uniformly to f on Xx. 


15. A sequence (f,) of measurable real-valued functions on X is said to converge in measure to 
a measurable function f provided that for each yn > 0, 


Jim w{x€X| |fa(x) — f(*)|>n} =0. 


A sequence (f,) of measurable functions is said to be Cauchy in measure provided that for 
each « > 0 and 7 > 0, there is an index N such that for each m,n > N, 


mixeX| |falx) — fin(x)| >n}<e. 


(i) Show that if 4(X) < oo and { f,} converges pointwise a.e. on X to a measurable function 
f, then { f,} converges to f in measure. (Hint: Use Egoroff’s Theorem.) 


(ii) Show that if {f,} converges to f in measure, then there is a subsequence of { f,} that 
converges pointwise a.e. on X to f. (Hint: Use the Borel-Cantelli Lemma.) 


(iii) Show that if {f,,} is Cauchy in measure, then there is a measurable function f to which 
{fn} converges in measure. 


16. Assume p(X) < oo. Show that {f,} > f in measure if and only if each subsequence of { fy} 
has a further subsequence that converges pointwise a.e. on X to f. Use this to show that for 
two sequences that converge in measure, the product sequence also converges in measure to 
the product of the limits. 


18.2 INTEGRATION OF NONNEGATIVE MEASURABLE FUNCTIONS 


In Chapter 4 we developed integration for Lebesgue measurable functions of a real variable 
with respect to Lebesgue measure. We first defined the integral of a simple function 
over a set of finite Lebesgue measure. The second step was to define the concepts of 
integrability and integral for a bounded function on a set of finite measure and use the 
Simple Approximation Lemma to show that a bounded measurable function that vanished 
outside a set of finite Lebesgue measure is integrable and that the integral of such functions 
possessed the anticipated linearity, monotonicity, and additivity over domains properties. 
We then defined the Lebesgue integral of a nonnegative Lebesgue measurable function f 
over an arbitrary Lebesgue measurable set E to be the supremum of { 7& as granged over all - 
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bounded Lebesgue measurable functions g for which 0 < g < f on E and that vanish outside 
a set of finite Lebesgue measure. This approach is not appropriate in the case of a general 
measure space. Indeed, for a measure space (X, M, yw), if u(X) = oo, we certainly want 
fy 1du = oo. However, if X isnonempty, M = {X, 6} and the measure p is defined by setting 
#(G) = 0 and p(X) = oo, then the only measurable function g that vanishes outside of a set 
of finite measure is g=0, and hence the supremum of { y & dp over such functions is zero. To 
circumvent this difficulty, for the general integral, we first define the integral of nonnegative 
simple functions and then define the integral of a nonnegative measurable function directly 
in terms of integrals of nonnegative simple functions. We almost immediately establish a 
general version of Fatou’s Lemma and make this the cornerstone of further development. 
We devote this section to integration of nonnegative measurable functions. 


Definition Let (X,M, w) be a measure space and f a nonnegative simple function on X. 
Define the integral of over X, Jy wdp, as follows: if p = 0 on X, define f, du = 0. 
Otherwise, let c,, C2, ..., Cn be the positive values taken by on X and, for 1 < k <n, define 
Ey = {x € X|b(x) = cy}. Define 


[ van = 3 cx-u(Ey), (1) 
X k=1 


using the convention that the right-hand side is 00 if, for some k, u( Ex.) = 00. For a measurable 
subset E of X, the integral of over E with respect to pw is defined to be { y¥: Xedp and 
denoted by {, f du. 


Proposition 8 Let (X,M, uw) be a measure space and ¢ and nonnegative simple function 
on X. If a and B are positive real numbers, then 


[tav+B-eldu=a-f vdu+B- [ody (2) 
Xx X X 
If A and B are disjoint measurable subsets of X, then 
[vans f vdus f ya. (3) 
AUB A B 
In particular, if Xo C X is measurable and p(X ~ Xo) = 0, then 
| vdu=[ wap. (4) 
Xx Xo 


Furthermore, if hb < ¢ a.e. on X, then 


[vans | ody (5) 


Proof If either y or ¢ 1s positive on a set of infinite measure, then the linear combination 
a--+B-@ has the same property and therefore each side of (2) is infinite. We therefore 
assume both w and ¢ vanish outside a set of finite measure and hence so does the linear 
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combination a - y+ B-¢. In this case the proof of (2) is exactly the same as the proof 
for Lebesgue integration of functions of a real variable (see the proofs of Lemma 1 and 
Proposition 2 on page 72). The additivity over domains formula follows from (2) and the 
observation that, since A and B are disjoint, 


W-XauB=W-Xat+W-ypon X. 


To verify (5), first observe that since the integral of a simple function over a set of measure 
zero is zero, by (3), we may assume y < ¢ on X. Observe that since ¢ and y take only a finite 
number of real values, we may express X as U7 _, Xx, a disjoint union of measurable sets for 
which both 9 and yf are constant on each X;,. Therefore 


n n 
y= SD) dk XX; and ¢ = SD bk XX: where a, <b, forl <k <n. (6) 
k-1 k=1 


But (2) extends to finite linear combinations of nonnegative simple functions and therefore 
(5) follows from (6). a 


Definition Let (X,M, 41) be a measure space and f a nonnegative extended real-valued 
measurable function on X. The integral of f over X with respect to y, which is denoted by 
f x J dp, is defined to be the supremum of the integrals [ x dp as ranges over all simple 
functions » for which 0 < g < f on X. For a measurable subset E of X, the integral of f over 
E with respect to p is defined to be fy f - xg dwand denoted by J, f du. 


We leave it as an exercise to verify the following three properties of the integral of 
nonnegative measurable functions. Let (X,.M, 1) be a measure space, g and h nonnegative 
measurable functions on X, Xp a measurable subset of X, and a a positive real number. Then 


[aeduaa- | edu; (7) 
xX xX 
if g<hae.on Xx, then f edu < [hd (8) 
xX xX 
[ sau=f sdwitu(x~ x0) =0 (9) 
X Xo 


Chebychev’s Inequality Let (X,.M, «) be a measure space, f a nonnegative measurable 
function on X, and X a positive real number. Then 


w{rex| Mx)zahs5f fap (10) 


Proof Define X, = {x € X| f(x) => A} and @ =A- yx,. Observe that0 < o < fon X and 
is a simple function. Therefore, by definition, 


awa) =f edu ff tau 


Divide this inequality by A to obtain Chebychev’s Inequality. O 
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Proposition 9 Let (X,.M, yw) be a measure space and f a nonnegative measurable function 
on X for which J, f du < 00. Then f is finite a.e. on X and {x € X| f(x) > 0} is o-finite. 


Proof Define X.. = {x € X| f(x) = oo} and consider the simple function » = yy... By 
definition, /, du = (Xo) and since 0 < w < f onX, (Xoo) < fy f dw < ov. Therefore 
f is finite a.e. on X. Let n be a natural number. Define X, = {x € X| f(x) = 1/n}. By 
Chebychev’s Inequality, 


w(Xn) <n. | f dp < ov. 
X 
Moreover, 
{xe X | f(x) >0$=VUE, 
n=l 
Therefore the set {x € X | f(x) > 0} is o-finite. LJ 


Fatou’s Lemma Let (X,M, w) be a measure space and {f,} a sequence of nonnegative 
measurable functions on X for which {f,} > f pointwise a.e. on X. Assume f is measurable. 
Then 


| fdp< mint f fn dp. (11) 
X X 


Proof Let Xv be a measurable subset of X for which u(X ~ Xy) = Oand { f,} > f pointwise 
on Xo. According to (9), each side of (11) remains unchanged if X is replaced by Xp. We 
therefore assume X = Xo. By the definition of f y J dp as a supremum, to verify (11) it is 
necessary and sufficient to show that if g is any simple function for which 0 < g < f on X, 
then 


| gdp < mint f fn dp. (12) 
x x 


Let ¢ be such a function. This inequality clearly holds if f xy¢dp =0. Assume fy pdu > 0. 


Case 1: [ x ¢ dp = oo. Then there is a measurable set X.. C X and a>0 for which p(X.) = 00 
and ¢ = a on Xq. For each natural number n, define 


An ={xeX| fi(x) = a/2 for allk > n}. 


Then {A,}°°., is an ascending sequence of measurable subsets of X. Since Xoo C Ue An, 
by the continuity and monotonicity of measure, 


e,¢) 
him #( An) = w((U An) > H(Xoo) = 00. 
n=1 
However, by Chebychev’s Inequality, for each natural number n, 
2 2 
LAr) < | fndp < “| fn dp. 
a An aJjy 


Therefore lim, +00 fy fndp = 00 = fy pdp. 
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Case 2:0 < f y ¢dpu < 00. By excising from X the set where ¢ takes the value 0, the left-hand 
side of (12) remains unchanged and the right-hand side does not increase. Thus we may 
suppose that » > 0 on X and therefore, since ¢ is simple and [, pdw < 00, u(X) < 00. To 
verify (12), choose € > 0. For each natural number n, define 


Xn, = {xe X | fir(x) > (1 -€) g(x) for allk >n}. 


Then {X,} is an ascending sequence of measurable subsets of X whose union equals X. 
Therefore {X ~ X,,} is a descending sequence of measurable subsets of X whose intersection 
is empty. Since u(X) < oo, by the continuity of measure, limy—o u(X ~ Xn) = 0. Choose 
an index N such that w(X ~ X,) <e for all n > N. Define M > 0 to be the maximum of 
the finite number of values taken by g on X. We infer from the monotonicity and positive 
homogeneity properties, (8) and (7), of integration for nonnegative measurable functions, 
the additivity over domains and monotonicity properties, (3) and (5), of integration for 
nonnegative simple function and the finiteness of [ y ¢ap that, forn > N, 


[ frdue f fn dp > (1-€) fy gdp 
X Xn 


=(1-6) f edu-(1-6) odp 
X X~Xn 


= (1-6) f edu | odp 
X X~Xy 


> (1-6) [ edu-em 

xX 

=| edu -< 
xX 


mint f faux | edu 
X X 


| edu+m 
X 


Hence 


| edu+m 
X 


This inequality holds for all « > 0 and hence, since {, pdu + M is finite, it also holds for 
e=0. UI 


In Fatou’s Lemma, the limit function f is assumed to be measurable. In case { f,,} 
converges pointwise to f on all of X or the measure space is complete, Theorem 6 tells us 
that f is measurable. 

We have already seen in the case of Lebesgue integration on the real line that the 
inequality (11) may be strict. For instance, it is strict for Lebesgue measure on X = [0, 1] and 
fn =~ X{0,1/n] for all n. It is also strict for Lebesgue measure on X = R and fy = X{n,n+1] 
for all n. However, for a sequence of measurable functions { f,} that converges pointwise on 
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X to f, in the case of Lebesgue integration for functions of a real variable, we established a 
number of criteria for justifying passage of the limit under the integral sign, that is, 


lim | [ frdp| = [ jim ho du. 


Each of these criteria has a correspondent in the general theory of integration. We first 
establish a general version of the Monotone Convergence Theorem. 


The Monotone Convergence Theorem Let (X,.M, 1) be a measure space and { f,} an 
increasing sequence of nonnegative measurable functions on X. Define f(x) = limn—+o fn(X) 


for each x € X. Then 
lim | frdu= | f dp. 
nO X X 


Proof Theorem 6 tells us that f is measurable. According to Fatou’s Lemma, 


[ fan stimint [frau. 
x x 
However, for eachn, f, < f on X, and so, by (8), fy frdu < fy f dp. Thus 

limsup | fr dp < | f dp. 

x x 
Hence 
| fau= tim | ada LJ 
X noo X 


Beppo Levi’s Lemma Let (X, M, ) be a measure space and { fy} an increasing sequence 
of nonnegative measurable functions on X. If the sequence of integrals { ly fn du} is bounded, 
then { f,} converges pointwise on X to a measurable function f that is finite a.e. on X and 


tim | frdu= [i fap <co 
noo xX xX 


Proof Define f(x) = limy—.oo f(x) for each x € X. The Monotone Convergence Theorem 
tells us that { tytn du} — fy f dw. Therefore, since the sequence of real numbers { tytn du} 


is bounded, its limit is finite and so [ y f du < ov. It follows from Proposition 9 that f is finite 
a.e.on X. UW 


Proposition 10 Let (X,M, w) be a measure space and f a nonnegative measurable function 
on X. Then there is an increasing sequence {is,} of simple functions on X that converges 


pointwise on X to f and 
lim | Wn au= | f dp. (13) 
noo X X 


Proof Apply the Simple Approximation Theorem and the Monotone Convergence 
Theorem. L 


< 
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Proposition 11 Let (X,M, 4.) be a measure space and f and g nonnegative measurable 
functions on X. If a and B are positive real numbers, then! 


[ie t+8-aliu=a- f fdn+e- | eau. (14) 


Proof In view of (7), it suffices to establish (14) for a = B = 1. According to the preceding 
theorem, there are increasing sequences {,,} and {g,,} of nonnegative simple functions on X 
that converge pointwise on X to g and f, respectively, 


tim, f vndu = f edu and jim f ondu= fi fay. 
Then {¢, + #,} is an increasing sequence of simple functions that converges pointwise on 


X to f +g. By the linearity of integration for nonnegative simple functions, the linearity of 
convergence for sequences of real numbers and the Monotone Convergence Theorem, 


| [f+eldu =lim,.. | [On + Wn] du 
»¢ X 


= timce|f ond t | indi 
X X 


= limy sco | gndu+ lim | Wn dp 
X nO JX 


= faut | gdp 
[fae oe O 


We have defined the integral of a nonnegative measurable function but so far not 
defined what it means for such a function to be integrable. 


Definition Let (X,M, «) be a measure space and f a nonnegative measurable function on 
X. Then f is said be integrable over X with respect to 4 provided J, f du < 0. 


The preceding proposition tells us that the sum of nonnegative integrable functions is 
integrable while Proposition 9 tells us that a nonnegative integrable function is finite a.e.and 
vanishes outside a o-finite set. 


PROBLEMS 


In the following problems, (X, M, 2) is a measure space, measurable means with respect 
to M, and integrable means with respect to p. 


17. Prove (7) and (8). Use (8) to prove (9). 


‘Since a and B are positive and f and g are nonnegative extended real-valued functions, a f + Bg is an extended 
real-valued function that is properly defined pointwise on all of X. 
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18. 


19. 


24. 


Let {u,} be a sequence of nonnegative measurable functions on X. For x € X, define 
io@) 

f(x) = 3 uy(x). Show that 
n=1 


fau=> J un dy 
X n=] {7X 


Show that if f is a nonnegative measurable function on X, then 


| f4u=0it and onlyit f = 0ae.on x 
X 


. Verify (2) in the case y and ¢ vanishes outside a set of finite measure. 


. Let f and g be nonnegative measurable functions on X for which g < f a.e. on X. Show that 


f =gae. on X if and only if fy gdu = J, f du. 


. Suppose f and g are nonnegative measurable functions on X for which f2 and g2 are 


integrable over X with respect to 1. Show that f - g also is integrable over X with respect to pw. 


. Let X be the union of a countable ascending sequence of measurable sets {X,} and fa 


nonnegative measurable function on X. Show that f is integrable over X if and only if there 
is an M > 0 for which ipa f dp <M forall n. 


Show that the definition of the integral of a nonnegative measurable function on a general 
measure space is consistent with the definition given in the particular case of the Lebesgue 
integral of a function of a real variable. 


. Let 7 be the counting measure on the natural numbers N. Characterize the nonnegative 


real-valued functions (that is, sequences) that are integrable over N with respect to 7 and the 
value of fy, f dn. 


. Let xo be a point in a set X and 6,, the Dirac measure concentrated at xp. Characterize the 


nonnegative real-valued functions on X that are integrable over X with respect to 6 xq and the 
value of {, f d8,). 


18.3. INTEGRATION OF GENERAL MEASURABLE FUNCTIONS 


Let (X, M) be a measurable space and f a measurable function on X. The positive part 
and the negative part of f, f+ and f~, are defined by 


f* =max{f, 0} and f— = max{—f, 0} on X. 


Both f* and f~ are nonnegative measurable functions on X for which 


f = ft —f- and|f| = ft + f7 on X. 


Since 0 < ft <|f| and 0 < f~ <|f| on X, we infer from (8) that if | f| is integrable 


over X,so are f* and f~ Conversely, by linearity of integration for nonnegative functions, 
if ft and f~ are integrable over X, so is | f]. 
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Definition Let (X, M, 1) be a measure space. A measurable function f on X is said to be 
integrable over X with respect to u provided | f| is integrable over X with respect to . For 
such a function, we define the integral of f over X with respect to p by 


[ tau= | rran-f fan 


For a measurable subset E of X, f is said to be integrable over E provided f - x is integrable 
over X with respect to .. The integral of f over E with respect to p is defined to be f[, f -Xedp 
and denoted by |, f du. 


The Integral Comparison Test Let (X, M, 4) be a measure space and f a measurable 
function on X. If g is integrable over X and dominates f on X inthe sense that | f| < g a.e. on X, 
then f is integrable over X and 


jf ta 


Proof The inequality (8) tells us that | f| is integrable over X. We invoke Proposition 11 and 
the inequality (8) once more to conclude that 


[ taul=|[ pean [au sf rdus | pau= ff isidus | edn 


LI 
Remark Let (X,M, w) be a measure space and f be integrable over X. We infer from 
Proposition 9, when applied to the positive and negative parts of f , that f is finite a.e. on X. 
Therefore, if g and h are integrable over X, the sum g +h is defined on X by the convention 
established in the remark on page 361. Furthermore, by (9), applied to the positive and negative 
parts of g +h, if Xo is the set of points in X at which both g and h are finite, then 


< [ ifldu < [ dp. (15) 


[its+mdn= [e+ Map 


Therefore the integral of h + g over X is properly defined, that is, it does not depend on the 
choice of functional value assigned to h + g at those points in X at which h and g take infinite 
values of opposite sign. 


Theorem 12 Let (X, M, yw) be a measure space and f and g be integrable over X. 


(Linearity) For real numbers a and B, af + Bg is integrable over X and 


[lar+eeldu=af rau+6] edu. 


(Monotonicity) If f < g a.e. on X, then 


[ fas | edu. 
JX X 
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(Additivity Over Domains) If A and B are disjoint measurable subsets of X, then 


[teem fi tau [pau 


Proof We prove linearity for coefficients a = B = 1 and leave the extension to the case 
of general coefficients as an exercise. Both | f| and | g| are integrable over X. According to 
Proposition 11, the sum | f| + |g| also is integrable over X. Since lf+el <|fl+lgl on X, we 
infer from (8) that | f + g| is integrable over X. Therefore the positive and negative parts of 
f,gand f + g are integrable over X. According to Proposition 9, by excising from X a set of 
measure Zero and using (9), we may assume that f and g are finite on X. To verify linearity 


is to show that 
[Urtet'du- firterdu= J pdu-f rdul+| fede | an) (16) 
X X X X X X 
But 
(f+8)"—(f+s) =f+e=(ft-f-)+(gt—97) onX, 
and therefore, since each of these six functions takes real values on X , 


(fts) +f +3 =(fts) t+ ft tet on. 
We infer from Proposition 11 that 
[trteyraus | fo dp+ | edu= | (f+ey dus | pans f er dy 
X X X X X X 
Since f, g and f+ are integrable over X, each of these six integrals is finite. Rearrange these 


integrals to obtain (16). We have established the linearity of integration. The monotonicity 
property follows from linearity since if f < ga.e. on X, then g— f > 0a.e. on X and therefore 


os e-fdu=f edu fay 


Additivity over domains follows from linearity and the observation that, since A and B are 
disjoint, 


+ 


f-Xaup=f-Xatf-xgonX. U 


As we have seen in the case of Lebesgue integration for functions of a real variable, 
the product of integrable functions is not, in general, integrable. In the following chapter 
we establish a general Hélder’s Inequality and thereby describe integrability properties of 
products of functions. 


Theorem 13 (the Countable Additivity Over Domains of Integration) Let (X, M, pw) bea 
measure space, the function f be integrable over X, and {X,, }re_, 4 disjoint countable collection 
of measurable sets whose union is X. Then 


> [saw (17) 
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Proof We assume f > 0. The general case follows by considering the positive and negative 
parts of f. For each natural number n, define 


n 
Sn = S f -Xx, on X. 
k=1 


The summation formula (17) now follows from the Monotone Convergence Theorem and 
the linearity of integration. U 


For a nonnegative integrable function g on X, this theorem tells us that the set function 
E-> J, gdu defines a finite measure on M? and hence has the continuity properties possessed 
by measures. This observation, applied to the positive and negative parts of an integrable 
function, provides the proof of the following theorem. 


Theorem 14 (the Continuity of Integration) Let (X, M, w) be a measure space and the 
function f be integrable over X. 


(i) If {Xp}°°., is an ascending countable collection of measurable subsets of X whose 


union is X, then 
| au= im, | f dp. (18) 
x no Jy 


(it) If {Xp}%-, is a descending countable collection of measurable subsets of X, then 


bev 


n=] “*n 


fdp= lim i] f dw. (19) 
noo Xn 


So far the only class of integrable functions we have are simple functions that vanish 
outside a set of finite measure. The following theorem presents a much larger linear space of 
integrable functions. 


Theorem 15 Let (X,M, uw) be a measure space and f a measurable function on X. If f is 
bounded on X and vanishes outside a set of finite measure, then f is integrable over X. 


Proof We assume f > 0 on X. The general case follows by considering the positive and 
negative parts of f. Let Xo be a set of finite measure for which f vanishes on X ~ Xp. Choose 
M > Osuch that 0 < f < M on X. Define g = M- yy,. Then0 < f < gon X. We infer from 
(8) that 


[ fans | edu =M-u(X0) <oo. O 
xX xX 


Corollary 16 Let X be a compact topological space and M a o-algebra of subsets of X that 
contains the topology on X. If f is a continuous real-valued function on X and (X , M, ») is 
a finite measure space, then f is integrable over X with respect to m. 


Proof Since f is continuous, for each open set O of real numbers, f~!(O) is open in X and 
therefore belongs to M. Thus f is measurable. On the other hand, since X is compact, f is 


2The integral over the empty-set is defined to be zero. 
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bounded. By assumption, u(X) < oo. The preceding theorem tells us that f is integrable 
over X with respect to p. 


We now return to the task of establishing criteria that justify, for a sequence of 
integrable functions that converges pointwise to a limit function, passage of the limit under 
the integral sign. 


The Lebesgue Dominated Convergence Theorem Let (X, M, 1) be a measure space and 
{ fn} a sequence of measurable functions on X for which {f,,} > f pointwise a.e. on X and the 
function f is measurable. Assume there is a nonnegative function g that is integrable over X 
and dominates the sequence { f,} on X in the sense that 


l ful < ga.e.on X forall n. 


Then f is integrable over X and 


NO X xX 


Proof For each natural number n, the nonnegative functions g— f,, and g+ f, are measurable. 
By the integral comparison test, for each n, f and f,, are integrable over X. Apply Fatou’s 
Lemma and the linearity of integration to the two sequences of nonnegative measurable 
functions {g — f,} and {g + f,} in order to conclude that 


[sam [tau = f le- slaw stimint [te foldn= f edu —timsup f fea: 


| sau | fdw= f e+ Ady stimint [ fg + faldu= fi gdu-+timint f fn dp; 
x x x x x x 
Therefore 


Himsup [ tn du < | f dy. < timing | fn Ap. 0 
X X X 


We established the Vitali Convergence Theorem for the Lebesgue integral of a function 
of a single real variable, first for integrals over sets of finite Lebesgue measure (see page 94) 
and then for integrals over sets of infinite Lebesgue measure (see page 98). We now establish 
a slight variation of this theorem for general integrals. 


Definition Let (X, M, 1) be a measure space and { f,} a sequence of functions on X, each 
of which is integrable over X. The sequence { f,} is said to be uniformly integrable over X 
provided for each € > 0, there is a 5 > 0 such that for any natural number n and measurable 
subset E of X, 


if u(E) <6, then | \faldu<e (20) 


The sequence { f,} is said to be tight over X provided for each « > 0, there is a subset Xo of X 
that has finite measure and, for any natural number n, 


| aldu <e. 
X~ Xo 
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Proposition 17 Let (X,M, i) be a measure space and the function f be integrable over X. 
Then for each € > 0, there is a 6 > 0 such that for any measurable subset E of X, 


if u( E) <6, then | \fldu<e (21) 


Furthermore, for each € > 0, there is a subset Xo of X that has finite measure and 
| lfldu<e. (22) 
X~ Xo 


Proof We assume f > 0 on X. The general case follows by considering the positive and 
negative parts of f. Let « >,0. Since f y f du is finite, by the definition of the integral of a 
nonnegative function, there is a simple function % on X for which 


0<wsfonxandd< | fdu- | va <e)2. 
X X 


Choose M > 0 such that 0 < # < M on X. Therefore, by the linearity and monotonicity of 
integration, if E € X is measurable, then 


[ teu | vans [Or-Wews f vay te/2<M-m(E) +6/2 


Thus (21) holds for 6 = €/2M. Since the simple function is integrable over X, the 
measurable set Xo = {x € X | h(x) > 0} has finite measure. Moreover, 


[fame] U-wlaw s | adn <e 


The proof is complete. O 


The Vitali Convergence Theorem Let (X, M, 1) be ameasure space and { f,} a sequence of 
functions on X that is both uniformly integrable and tight over X. Assume { f,} > f pointwise 
a.e. on X and the function f is integrable over X. Then 


lim | f,duw= i] f dp. 

n> & E E 
Proof Observe that for all n, | f— f,,| < | f|+|f,| on X. Therefore, by the integral comparison 
test and additivity over domains and monotonicity properties of integration, if Xo and X, 


are measurable subsets of X for which X; C Xo, then for all n, since X is the disjoint union 
X = X, U[Xo~ X,] U[X~ Xo], 


[i — f\dp 


<| I fn ~sdut falda J Wal + isle (23) 


Let «> 0. By the preceding proposition, the tightness of { f,,}, and the linearity of integration, 
there is a measurable subset Xo of X of finite measure for which 


[ ttifildn= [ald | fldu<é/3foralln. (24) 
X~ XQ X~ Xo X~ Xo 
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By the preceding proposition, the uniform integrability of { f,}, and the linearity of integra- 
tion, there is a 6 > 0 such that for any measurable subset E of X, 


if w(E) <8, then [ (al + Llldw =f Ufaldu + [ [fldw<é/3foralln. (25) 
E 


By assumption, f is integrable over X. Therefore f is finite a.e. on X. Moreover, u( Xo) <0. 
We may therefore apply Egoroff’s Theorem to infer that there is a measurable subset X, of 
Xo for which pu( Xo ~ X;) <6 and { f,,} converge uniformly on X; to f. It follows from (25) that 


| [fal + [fl] du < €/3 for all n. (26) 
Xo~X1 


On the other hand, by the uniform convergence of { f,} to f on Xj, a set of finite measure, 
there is an N for which 


| fn — fldu < sup | fn(x) — f(x) -w(X1) < €/3 for alln > N. (27) 
X41 xEX] 

From the inequality (23), together with the three estimates (24), (26), and (27), we conclude 
that 


[tna du <eforalln > N. 
X 


The proof is complete. UL 


The Vitali Convergence Theorem for general measure spaces differs from the special 
case of Lebesgue measure on the real line. In the general case, we need to assume that 
the limit function f is integrable over E. The integrability of f does not follow from the 
uniform integrability and tightness of {f,,} as it does in the case of Lebesgue integration 
on the real line (see, however, Problems 36 and 37). Indeed, let X be a set that contains 
a proper nonempty set E. Consider the c-algebra M = {@, E, X~E, X} and define 
H(B) = 0, (FE) = p(X~E) = 1/2 and w(X) = 1. For each natural number n, define 
fn =N-XE—N- Xx ~ ZF]. The sequence { f,} is uniformly integrable and tight and converges 
pointwise on X to the function f that takes the constant value oo on E and —oo on X~ E. 
The limit function is not integrable over X with respect to p. 

We leave the proof of the following corollary as an exercise. 


Corollary 18 Let (X, M, w%) be a measure space and {h,} a sequence of nonnegative 
integrable functions on X. Suppose that {h,(x)} — 0 for almost all x in X. Then 


lim. | h, dp = Oif and only if {hy} is uniformly integrable and tight. 
n—-> xX 


PROBLEMS 


In the following problems, (X, M, yw) is a reference measure space, measurable means with 
respect to M, and integrable means with respect to p. 
27. For a set X, let M be the o-algebra of all subsets of X. 


(i) Let 7 be the counting measure of M. Characterize the real-valued functions f on X, 
which are integrable over X with respect to 7 and the value of {, f dy for such functions. 


28. 
29. 


30. 


31. 


32. 


33. 


34. 


35. 
36. 


37. 
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(ii) Let xp be amember of X and 6,, the Dirac delta measure concentrated at xp. Characterize 
the real-valued functions f on X, which are integrable over X with respect to 6,, and the 
value of J, f d5,, for such functions. 


Show that if f is integrable over X, then f is integrable over every measurable subset of X. 


Let f be a measurable function on X and A and B measurable subsets of X for which 
X = AUBand AN B= &. Show that f is integrable over X if and only if it is integrable over 
both A and B. 


Let X be the disjoint union of the measurable sets {X,,}°° ,. For a measurable function f on 
X, characterize the integrability of f on X in terms of the integrability and the integral of f 
over the X,’s. 


Let (X, M, w) be a measure space for which ».(X) = 0 and the function f on X take the 
constant value oo. Show that f, f du =0. 


Let f be integrable over X with respect to w. Show that f,, f du = 0 for every measurable 
subset E of X if and only if f = 0 a.e. on X. 


Let (X,.M, «) be a measure space and f a bounded measurable function on X that vanishes 
outside a set of finite measure. Show that 


[ fay =sup | dy = int [ ody. 
vX xX vX 


where w ranges over all simple functions on X for which w < f on X and ¢ ranges over all 
simple functions on X for which f < g on X. 


Let (X, M, w) be a measure space and f a bounded function on X that vanishes outside a 
set of finite measure. Assume 


sup | du =int f gdp, 
X X 


where ranges over all simple functions on X for which » < f on X and ¢ ranges over all 
simple functions on X for which f < g on X. Prove that f is measurable with respect to the 
completion of (X, M, p). 


Prove the linearity property of integration for general coefficients a and B. 


Let {f,} be a sequence of integrable functions on X that is uniformly integrable and tight. 
Suppose that {f,} — f pointwise a.e. on X and f is measurable and finite a.e. on X. Prove 
that f is integrable over X. 


Let {fn} be a sequence of integrable functions on X that is uniformly integrable. Suppose 
that {f,} -~ f pointwise a.e. on X and f is measurable. Assume the measure space has the 
property that for each e > 0, X is the union of a finite collection of measurable sets, each of 
measure at most e. Prove that f 1s integrable over X. 


. Prove Corollary 18. 
39. 


Deduce the Lebesgue Dominated Convergence Theorem from the Vitali Convergence 
Theorem. 


. Show that almost everywhere convergence can be replaced by convergence in measure in 


the Lebesgue Dominated Convergence Theorem and the Vitali Convergence Theorem (see 
Problem 15 for the definition of convergence in measure). 
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41. 


42. 


43. 


45. 


Let { f,} be a sequence of functions on X, each of which is integrable over X. Show that { f,} 
is uniformly integrable if and only if for each € > 0, there is a 6 > 0 such that for any natural 
number n and measurable subset E of X, 


if u(E) <6, then 


| fn aw <e, 
E 


Let 4 be another measure on M. For an extended real-valued function f on X that is 
measurable with respect to the measurable space (X, M), under what conditions is it true 


that — 
[ tan+nl=f tans f fan 


Let Mo be a o-algebra that is contained in M, yo the restriction of w to Mo, and f a 
nonnegative function that is measurable with respect to Mo. Show that f is measurable with 


respect to M and 
[ sano s [ pau. 
X X 


Can this inequality be strict? 


. Let v be a signed measure on (X, M). We define integration over X with respect to a signed 


measure v by defining 


[tam sat [ par, 


provided f is integrable over X with respect to both yt and vy. Show that if | f| < Mon X, then 


[te 


Moreover, if |y|(X )<oo, show that there is a measurable function f with | f| < lon X for which 


< M|p|(X). 


| f dv = |v|(X). 
X 
Let g be a nonnegative function that is integrable over X. Define 


E) = | g du for all Ee M. 
E 


(i) Show that v is a measure on the measurable space (X, M). 


(ii) Let f be a nonnegative function on X that is measurable with respect to M. Show that 


[ tav=[ ted 


(Hint: First establish this for the case when f is simple and then use the Simple 
Approximation Lemma and the Monotone Convergence Theorem.) 
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46. Letv: M — [0, co) bea finitely additive set function. Show that if f is a bounded measurable 
function on X, then the integral of f over X with respect to p, f y f dv, can be defined so that 
I, xeEdv = v(E), if E is measurable and integration is linear, monotone, and additive over 
domains for bounded measurable functions. 


47, Let » be a finite premeasure on an algebra S and p its Carathéodory extension. Let E be 
p*-measurable. Show that for each € > 0, given there is an A € S with 


u({[A~ E]U[E~ A]) <e. 


48. Let S be an algebra of subsets of a set X. We say that a function g: X > R is S-simple 
provided » = S)_, axXa,, where each A, € S. Let yw be a premeasure on S and yp its 
Carathéodory extension. Given € > 0 and a function f that is integrable over X with respect 
to #1, show that there is an S-simple function ¢ such that 


| If —oldp <e. 
xX 


18.4 THE RADON-NIKODYM THEOREM 


Let (X, M) be a measurable space. For a measure on (X, M) and f a nonnegative 
function on X that is measurable with respect to M, define the set function v on M by 


(E)= | fdwtorall Be M. (28) 
E 


We infer from the linearity of integration and the Monotone Convergence Theorem that v 
is a measure on the measurable space (X, M), and it has the property that 


if Ee¢ Mand p(E) =0, thenv(E) = 0. (29) 


The theorem named in the title of this section asserts that if x is o-finite, then every o-finite 
measure v on (X, M) that possesses property (29) is given by (28) for some nonnegative 
function f on X that is measurable with respect to M. A measure v is said to be absolutely 
continuous with respect to the measure yw provided (29) holds. We use the symbolism v << ps 
for v absolutely continuous with respect to w. The following proposition recasts absolute 
continuity in the form of a familiar continuity criterion. 


Proposition 19 Let (X, M, «) be a measure space and v a finite measure on the measurable 
space (X, M). Then v is absolutely continuous with respect to s if and only if for each € > 0, 
there is a 6 > 0 such that for any set E € M, 


if u(E) <6, thenv(E) <e. (30) 


Proof It is clear that the ¢-6 criterion (30) implies that v is absolutely continuous with 
respect to yz, independently of the finiteness of v. To prove the converse, we argue by 
contradiction. Suppose v is absolutely continuous with respect to yz but the «—6 criterion 
(30) fails. Then there is an «9 > 0 and a sequence of sets in M, {E,}, such that for each n, 
p( En) <1/2" while v( E,) >= €o. For eachn, define A, = U7, Ex. Then {An} is a descending 
sequence of sets in M. By the monotonicity of v and the countable subadditivity of yw, 


v(A,) > €y and p(A,) < 1/2"! for all n. 
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Define Ago = 2, An. By the monotonicity of the measure 1, w( Ao.) = 0. We infer from 
the continuity of the measure v that, since v(A,) < v(X) <0o and v(A,) > € for all n, 
v(Aoo) = €o. This contradicts the absolute continuity of v with respect to p. 


The Radon-Nikodym Theorem Let (X, M, 1) be ao-finite measure space and v a c-finite 
measure defined on the measurable space (X, M) that is absolutely continuous with respect to 
p. Then there is a nonnegative function f on X that is measurable with respect to M for which 


M(B) = ff dp forall Be M (31) 


The function f is unique in the sense that if g is any nonnegative measurable function on X 
that also has this property, then g = f ae. [u]. 


Proof We assume that both w and v are finite measures and leave the extension to the 
o-finite case as an exercise. If v(E) = 0, for all E € M, then (31) holds for f =0 on X. So 
assume v does not vanish on all of M. We first prove that there is a nonnegative measurable 
function f on X for which 


[ fau>0and [ fay <v(e) forall E eM. (32) 
xX E 


For A > 0, consider the finite signed measure v — Aw. According to the Hahn Decomposition 
Theorem, there is a Hahn decomposition {P,, N,} for v — Ap, that is, X = P, UN, and Py, N 
N) = @, where P) is a positive set and N) is a negative set for v — Aw. We claim that there is 
some A > 0 for which u( P,) > 0. Assume otherwise. Let A > 0. Then p( P, ) = 0. Therefore 
#(E) = 0 and hence, by absolute continuity, v(E) = 0, for all measurable subsets of P). 
Since N) is a negative set for vy — Ap, 


v(E) < Ap(E) for all E € M and all A > 0. (33) 


We infer from these inequalities that v(E) = 0 if u(E) > 0 and of course, by absolute 
continuity, v(E) = 0 if u(E) = 0. Since u(X) < 00, »(E) = 0 for all E € M. This is a 
contradiction. Therefore we may select Ag > 0 for which yx( Py, ) > 0. Define f to be Ag times 
the characteristic function of P),. Observe that f y J dp > 0 and, since v — App is positive on 


Pro 
| f dp. = App( Py, NE) < v( Py, NE) < v(E) for all E € M. 
E 


Therefore (32) holds for this choice of f. Define F to be the collection of nonnegative 
measurable functions on X for which 


| f dp < v(E) for all E e M, 
E 
and then define 
M = Sup fer | f dp. (34) 
X 


We show that there is an f € F for which [{ y f du = M and (31) holds for any such f. If g 
and h belong to F, then so does max{g, h}. Indeed, for any measurable set E, decompose 
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E into the disjoint union of FE; = {x € E| g(x) < h(x)} and E7 = {x € E| g(x) > h(x)} and 
observe that 


[maxis nidu= | hdu+ | gdp <v(E,)+v(E2) =v(£). 
E Ej Ey 


Select a sequence {f,} in F for which lim,_,.. ty frdp = M. We assume {f,} is point- 
wise increasing on X, for otherwise, replace each f, by max{f,..., f,}. Define f(x) = 
limy+oo fn(x) for each x € X. We infer from the Monotone Convergence Theorem that 
, f du = M and also that f belongs to F. Define 


n(£)=0(£)- | fdp forall Be M. (35) 


By assumption, v(X) < oo. Therefore [ x f dp < v(X) < oo, and hence, by the countable 
additivity of integration, 7 is a signed measure. It is a measure since f belongs to Ff, and it 
is absolutely continuous with respect to w. We claim that 7 = 0 on M and hence (31) holds 
for this choice of f. Indeed, otherwise, we argue as we just did, with v now replaced by 7, to 
conclude that there is a nonnegative measurable function f for which 


[ Fau>oand | Fay <n(e)=0(8)- f fdetorall E eM. (36) 
X E E 


Therefore f + f belongs to F and Sylf + f]du> tx f dy = M, a contradiction of the 
choice of f. It remains to establish uniqueness. But if there were two, necessarily integrable, 
functions f, and f) for which (31) holds, then, by the linearity of integration, 


[ir — f2|dp =0 for all E e M. 
E 
Therefore f,; = fy a.e. [yu] on X. q 


In Problem 59 we outline another proof of the Radon-Nikodym Theorem due to John 
von Neumann: it relies on the Riesz-Fréchet Representation Theorem for the dual of a 
Hilbert space. 


Example The assumption of o-finiteness is necessary in the Radon-Nikodym Theorem. 
Indeed, consider the measurable space (X, M), where X = [0, 1] and /M is the collection 
of Lebesgue measurable subsets of [0, 1]. Define 2 to be the counting measure on M, so 
#( E) is the number of points in F if E is finite, and otherwise (FE) = oo. The only set of 
#4 measure zero is the empty-set. Thus every measure on ™ is absolutely continuous with 
respect to 4. Define m to be Lebesgue measure on M. We leave it as an exercise to show 
that there is no nonnegative Lebesgue measurable function f on X for which 


m(E) = | faptoral Be M 


Recall that for a measurable space (X, M) and signed measure v on M, there is the Jordan 
decomposition v = v; — v2, where vy, and v2 are measures on M, one of which is finite: 
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We define the measure |p| to be 1; + v7. If yz is a measure on M, the signed measure vp is 
said to be absolutely continuous with respect to yz provided |»| is absolutely continuous with 
respect to 4, which is equivalent to the absolute continuity of both v; and v2 with respect to 
yu. From this decomposition of signed measures and the Radon-Nikodym Theorem, we have 
the following version of this same theorem for finite signed measures. 


Corollary 20 Let (X, M, w) be a o-finite measure space and v a finite signed measure on 
the measurable space (X, M) that is absolutely continuous with respect to . Then there is a 
function f that is integrable over X with respect to p and 


v(E) = | fdpforall Ee M 


Recall that given two measures yz and v on a measurable space (X, M), we say that p 
and v are mutually singular (and write 4 | v) provided there are disjoint sets A and B in 
M for which X = AU Band »(A) = p(B) =0. 


The Lebesgue Decomposition Theorem Let (X, M, .) be ao-finite measure space and v a 
g-finite measure on the measurable space (X, M). Then there is a measure vy on M, singular 
with respect to pw, and a measure v; on M, absolutely continuous with respect to 1, for which 
v=o +4. The measures vy and vy are unique. 


Proof Define A = «+ v. We leave it as an exercise to show that if ¢ is nonnegative and 
measurable with respect to M, then 


[ sar= [caus f gdvtorat eM 
E E E 


Since w and v are o-finite measures, so is the measure A. Moreover, yw is absolutely continuous 
with respect to A. The Radon-Nikodym Theorem tells us that there is a nonnegative 
measurable function f for which 


me)= | fdr=[ fan+f favtora ee M (37) 


Define X4 = {x € X| f(x) > 0} and Xp = {x € X| f(x) = 0}. Since f is a measurable 
function, X = Xo U Xx is a disjoint decomposition of X into measurable sets and thus 
v = vo + yy Is the expression of v as the sum of mutually singular measures, where 


vo(E) = v(EN Xo) andy, (E) =»(EN X,) forall E eM. 


Now p( Xo) = ‘fA f dA =0, since f = O0on Xo, and vp( X+) = v(X4NXo) = v(W) = 0. Thus 
# and vp are mutually singular. It remains only to show that 1; is absolutely continuous with 
respect to w. Indeed, let u( E) = 0. We must show 1;(E) = 0. However, since (EF) = 0, 
f z J dp = 0. Therefore, by (37) and the additivity of integration over domains, 


[ save | fav+ | fdv=0. 
E ENX ENX4 
But f =00n EN Xo and f > 0 on E/N Xo and thus p(EN X;) =0, thatis,y;(E)=0. O 


4 


Section 18.4 The Radon-Nikodym Theorem 385 


A few words are in order regarding the relationship between the concept of absolute 
continuity of one measure with respect to another and their integral representation and 
the representation of an absolutely continuous function as the indefinite integral of its 
derivative, which we established in Chapter 6. Let [a, b] be a closed, bounded interval and 
the real-valued function h on [a, b] be absolutely continuous. According to Theorem 10 of 
Chapter 6, 


h(d) —h(c)= f h’ dy for all [c, d] C [a, b]. (38) 


We claim that this is sufficient to establish the Radon-Nikodym Theorem in the case 
X = [a, b], M is the o-algebra of Borel subsets on [a, b] and pu is Lebesgue measure on 
M. Indeed, let v be a finite measure on the measurable space ([a, b], M) that is absolutely 
continuous with respect to Lebesgue measure. Define the function h on [a, b] by 


h(x) =v([a, x]) for all x € [a, 5}. (39) 


The function h is called the cumulative distribution function associated with v. The function h 
inherits absolute continuity from the absolute continuity of the measure v. Therefore, by (38), 


v(E)= | h’ dp for all E = [c, d] C [a, }]. 
E 


However, we infer from Corollary 14 of the preceding chapter that two o-finite measures that 
agree on closed, bounded subintervals of [a, b] agree on the smallest o-algebra containing 
these intervals, namely, the Borel sets contained in [a, b]. Therefore 


v(E)= | h’ dp for all E € M. 
E 


The Radon-Nikodym Theorem is a far-reaching generalization of the representation of 
absolutely continuous functions as indefinite integrals of their derivatives. The function f 
for which (31) holds is called the Radon-Nikodym derivative of v with respect to y. It is 
often denoted by a 


PROBLEMS 


49. Show that the Radon-Nikodym Theorem for finite measures and v implies the theorem for 
o-finite measures p and pv. 


50. Establish the uniqueness of the function f in the Radon-Nikodym Theorem. 


51. Let [a, b] be a closed, bounded interval and the function f be of bounded variation on [a, }]. 
Show that there is an absolutely continuous function g on [a, b], and a function h on fa, b] 
that is of bounded variation and has h’ = 0 a.e. on [a, b], for which f = g+h on [a, b]. Then 
show that this decomposition is unique except for addition of constants. 


52. Let (X, M, u) bea finite measure space, {E;}?_, a collection of measurable sets, and {cx}; _, 
a collection of real numbers. For E € M, define 


n 
v(E) = > ce: w( EN Ex). 
k=1 
Show that v is absolutely continuous with respect to yz and find its Radon-Nikodym derivative 
dp. 


dp* 
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53. 


54. 


55. 


57. 


58. 


Let (X, M, w) be a measure space and f a nonnegative function that is integrable over 
X with respect to w. Find the Lebesgue decomposition with respect to u of the measure v 
defined by v(E) = f,, f du for Ee M. 

Let u., v, and A be o-finite measures on the measurable space (X, M). 

(i) If» <y and f is a nonnegative function on X that is measurable with respect to M, 


show that F 
| fav= | fo | 
xX x [ap 


(i) Ify<yp anda < yp, show that 


Ath) ww a yy 
iu dp ay 2° lH 


(in) If v <p <A, show that 


d dv d 
dh dp dx 

(iv) If v <p and uw << vp, show that 
dv dp 
—-—=]lae. [yl]. 
du dv a.e. [1] 


Let pw, v, vj, and v2 be measures on the measurable space (X, M). 
(i) Show thatify | wandvy< uy, theny —0. 


(ii) Show that if»; and v> are singular with respect to p, then, for any a > 0, B > 0, so is the 
measure av; + Br2. 


(ii) Show that if »; and v2 are absolutely continuous with respect to yw, then, for any 
a> 0, B = 0, so is the measure av, + Br. 


(iv) Prove the uniqueness assertion in the Lebesgue decomposition. 


. Characterize the measure spaces (X, M, uw) for which the counting measure on M is 


absolutely continuous with respect to yz and those for which, given xo € X, the Dirac measure 
6,, on M is absolutely continuous with respect to 


Let {un} be a sequence of measures on a measurable space (X, M) for which there is a 
constant c > 0 such that un(X) <c for all n. Define w: M — [0, oo] by 


Show that p is a measure on M and that each pz, is absolutely continuous with respect to p. 


Let . and v be measures on the measurable space (X, M) and define A = + v. Let the 
nonnegative function f on X be measurable with respect to (X, M). Show that f is integrable 
over X with respect to A if and only if it is integrable over X with respect to both p and v. 
Also show that if f is integrable over X with respect to A, then 


[ car= [edu f gavforat em 
E E E 
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59. (von Neumann’s proof of the Radon-Nikodym Theorem) The basis of this proof is the 
following assertion, which is a corollary of the Riesz-Fréchet Representation Theorem for 
the dual of a Hilbert space: For a measure space (X, M, A), let L?(X, A) be the collection 
of measurable functions f on X such that f* is integrable over X with respect to A. Suppose 
that the functional : L*(X, 4) — R is linear, and bounded in the sense that there is some 
c > 0 such that 


lw( f)I ce: f Pan for all f € L?(X, A). 


Then there is a function g € L?(X , A) such that 


W(f)= f f -gdA for all f € L?(X, A). 


Assuming this representation result, verify the following steps in another proof of the Radon- 
Nikodym Theorem, where py and v are finite measures on a measurable space (X, M) and v 
is absolutely continuous with respect to p. 


(i) Define the measure A = » + v on the measurable space (X, M) and the functional p on 
L*(X, A) by 


Wf) = i f dw for all f € L?(X, A). 


Show that w is a bounded linear functional on L*(X, A). 
(ii) By the above representation result, choose a function g € L*(X, A) such that 


| fau= | f-gdafor all f € L?(X, A). 

X X 

Conclude that 
| fau= | f-gdut / f-gdv forall f € L7(X, Xr), 
X X JX 

and therefore 


we) = [ edu + | gdvforall BEM 
E E 


From this last identity conclude that g > 0 ae. [A] on X and then use the absolute 
continuity of v with respect to w to conclude that A{x € X| g(x) = 0} =0. 


(iii) Use part (ii) to assume, without loss of generality, that g > 0 on X. Fix a natural number 
n and E € M and define f = yr/[g +1/n] on X. Show that f belongs to L7(X, A). 
Conclude that 


1 1 1 
——— qd -/|—_. d +[——. dv for all n. 
ear Be de gtiin Oo Je etiin ® 


Justify taking limits as n — 00 on each side of this equality and conclude that 


v(E) = | I/e- 1] dp for all E ¢ M. 
E 


60. Let X = [0,1], M the collection of Lebesgue measurable subsets of [0, 1], and take v to be 
Lebesgue measure and p the counting measure of M. Show that v is finite and absolutely 
continuous with respect to w, but there is no function f for which »(E) = f, dw for all 
EeM. 
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18.5 THE NIKODYM METRIC SPACE: THE VITALI-HAHN-SAKS THEOREM 


Let (X, M, uw) be a finite measure space. Recall that the symmetric difference of two 
measurable sets A and B is the measurable set A A B defined by 


AAB=[A~B]U[B~ A]. 
We leave the proof of the following set identity as an exercise: 
(AAB)A(BAC)=AAC. (40) 


We introduce a relation ~ on M by defining A ~ B provided #(AAB) = 0. The above 
identity implies that this relation is transitive, and it clearly is reflexive and symmetric. 
Therefore the equivalence relation ~ induces a decomposition of M into equivalence classes; 
denote this collection by M/~. For A € M, denote the equivalence class of A by [A]. On 
M/~, define the Nikodym metric p, by 


pu([A], [B]) = w(AA B) for all A, Be M. 


We infer from the identity (40) that p,, is properly defined and the triangle inequality holds; 
the remaining two properties of a metric are evident. We call (M/~, Py) the Nikodym 
metric space associated with the measure space (X, M, .). Now let v be a finite measure 
on M that is absolutely continuous with respect to u. For A,B €¢ M with A ~ B, since 
p( AAB) = 0, v( AAB) = 0, and hence 


v(A) — »(B) = [»(AN B) +»(A~B)] —[v(AN B) + v( B~A)] = v(A~B) — o( B~A) = 0. 
We may therefore properly define v on M/~ by setting 
v([A]) = »(A) for all A € M. 


As we did with the L? spaces, for convenience and simplicity, we denote members [A] of 
M/= by A, and functions v: M/~ — [0, 00) by v: M > [0, 00). A consequence of the 
Baire Category Theorem (Theorem 7 of Chapter 10) tells us that if a sequence of real-valued 
functions on a complete metric space converges pointwise to a real-valued function, then 
there is a point in the space at which the sequence is equicontinuous. To employ this result 
in the study of sequences of absolutely continuous measures, we now show that M/~ is 
complete and that a measure on M that is absolutely continuous with respect to w induces a 
uniformly continuous function on the Nikodym metric space (M/~, Pu). 

In Chapter 7, we normed the linear space of Lebesgue measurable functions on a 
Lebesgue measurable set of real numbers, denoted it by L!, and established the Riesz- 
Fischer Theorem which told us that L! is complete and every convergent sequence in L! 
had a subsequence that converges pointwise a.e. In the first section of the next chapter, for 
a general measure space (X, M, «), we define L'(X, ) in the obvious manner and prove 
the Riesz-Fischer Theorem in general. 


Theorem 21 Let (X, M, «) be a finite measure space. Then the Nikodym metric space 
(M, p,) is complete, that is, every Cauchy sequence converges. 
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Proof Observe that for A, B € M, 
p(AAB) =| Iva — XB| dp. (41) 
X 


Define the operator T: M — L!(X, ) by T(E) = xz. Then (41) is the assertion that the 
operator T is an isometry, that is, 


p,(A, B) =||T(A) — T(B)Ihi for all A, Be M. (42) 


Let {A,} be a Cauchy sequence in (M, p,). Then {7(A,)} is a Cauchy sequence in 
L'(X, w). The Riesz-Fischer Theorem tells us that there is a function f € L'(X, ) such 
that {T(A, )} > f in L'(X, w) and a subsequence of {T(A, )} that converges pointwise to 
f almost everywhere on X. Since each T(A, ) takes the values 0 and 1, if we define Ao 
to be the points in X at which the pointwise convergent subsequence converges to 1, then 
f = XA almost everywhere on X. Therefore, by (41), {An} > Ao in (M, p,.). The proof is 
complete. LJ 


Lemma 22 Let (X, M, «) be a finite measure space and v a finite measure on M. Let Eg be 
a measurable set and « > 0 and 6 > 0 be such that for any measurable set E, 


if py(E, Eo) <6, then |p(E) —v(Eo)| < «€/4. (43) 
Then for any measurable sets A and B, 
if pu( A, B) <6, then |v(A) —v(B)| <e. (44) 
Proof We first verify that 


if pp(A, 6) <6, then v( A) < /2. (45) 


Observe that if D C C, then C A D = C~D. Let A belong to M and p( A, 8) = w(A) <6. 
Observe that 
[Eo~A] A Ep = Ep~[Eo~A] = EQN ACA. 


Hence p,(Eo~A, Eo) = w([Eo~A] A Eo) < w(A) <4, and therefore, by assumption (43), 
v( Eo) — v( Eo~A) <é/4. 
We infer from the excision property of v that 
v(AN Eo) = (Eo) — v( Eo~wA) < €/4. 
Now observe that 
Eo A[Ep U[A~Ep]] = [Eo U[A~Eo]]~£0 = A~E0 CA. 
Thus, arguing as above, 


v(A~Ep) = v( Ep U [A~Eo]) — v( Eo) <€/4. 
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Therefore 
v(A) =v(AN Eo) + »( A~Ep) <€/2, 


and so (45) is verified. 


But for any two measurable sets, since v is real-valued and finitely additive, 
v(A) —»(B) =[v(A~B) +»(ANB)] — [p(B~ A) + (AN B)] = v(A~B) —v(B~A). 
Therefore (45) implies (44). O 


Proposition 19 tells us that a finite measure v on M is absolutely continuous with 
respect to y if and only if each « > 0, there is a 6 > 0 such that if u(E) <6, then »(E) <e. 
This means that if v is finite, then v is absolutely continuous with respect to p if and only if 
the set function v is continuous with respect to the Nikodym metric at . However, we infer 
from the preceding lemma that if a finite measure v on M is continuous, with respect to the 
Nikodym metric at one set Ep in M, then it is uniformly continuous on M. We therefore 
have established the following proposition. 


Proposition 23 Let (X, M, uw) be a finite measure space and v a finite measure on M that 
is absolutely continuous with respect to . Then v induces a properly defined, uniformly 
continuous function on the Nikodym metric space associated with (X, M, p). 


Definition Let (X, M) be a measurable space. A sequence {v,} of measures on M is said to 
converge setwise on MV to the set function v provided 


»(E) = lim »,(E) for all E € M. 


Definition Let (X, M, ») be a finite measure space. A sequence {v,} of finite measures on 
M, each of which is absolutely continuous with respect to 1, is said to be uniformly absolutely 
continuous’ with respect to w provided for each € > 0, there is a &>0 such that for any 
measurable set E and any natural number n, 


if w(E) <6, then v,(E) <e. 


It is not difficult to see, using Lemma 22, that a sequence of finite measures {vn} of M 
is uniformly absolutely continuous with respect to pu if and only if the sequence of functions 
{vn: M — R} is equicontinuous‘ with respect to the Nikodym metric p,. Moreover, for 
each natural number n, the Radon-Nikodym Theorem tells us that there is a nonnegative 
integrable function f,, the Radon-Nikodym derivative of j with respect to v,, for which 


v,(E) = | fedu forall & M 


3What we here call “uniformly absolutely continuous’ might also be called equi absolutely continuous. There 
is No standard terminology. 

4Recall that a sequence of functions {h,: 5 > R} on the metric space (S, p) is said to be equicontinuous at a 
point u ¢ S provided that for each € > 0, there is a 5 > 0 such that for v € S and natural number n, if plu, v) <6, 
then |tn(u) — hn(v)| <«. The sequence {hn : S > R} is said to be equicontinuous provided it is equicontinuous at 
each point in S. 
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It is clear that the sequence of functions { f,,} is uniformly integrable over X with respect to 
if and only if the sequence of measures {v,,} is uniformly absolutely continuous with respect 
to w. We therefore have the following proposition. 


Proposition 24 Let (X, M, w) be a finite measure space and {v,} a sequence of finite 
measures of M each of which is absolutely continuous with respect to . Then the following 
are equivalent: 


(t) The sequence of measures {v,} is uniformly absolutely continuous with respect to the 
measure . 


(ti) The sequence of functions {v,: M — R} is equicontinuous with respect to the Nikodym 
metric py. 


(iii) The sequence of Radon-Nikodym derivatives ($4) is uniformly integrable over X with 
respect to the measure wp. 


Theorem 25 Let (X, M, w) bea finite measure space and {v,,} a sequence of finite measures 
on M that is uniformly absolutely continuous with respect to p. If {v,} converges setwise on 
M to v, then v is a measure of M that is absolutely continuous with respect to p. 


Proof Clearly, v is a nonnegative set function. The setwise limit of finitely additive set 
functions is finitely additive. Therefore v is finitely additive. We must verify that it is countably 
additive. Let {E;,}>° , be a disjoint collection of measurable sets. We must show that 


(0 2 = S v( Ex). (46) 
k=1 k=1 


If there is a k such that v( E;) = oo, then, by the monotonicity of v, (46) holds since both 
sides are infinite. We therefore assume that v( E;) < oo for all k. By the finite additivity of 
v, for each natural number n, 


(6 n)- $e) +4 U) & (47) 
k=1 k=1 k=n+1 


Let « > 0. By the uniform absolute continuity with respect to yz of the sequence {v,}, there 
is ad > 0 such that for E measurable and any natural number n, 


if u(E) <6, then v,( EF) <«/2, (48) 


and therefore 
if u(E) <6, then v(E) <e. 


Since 4( X) < oo and p is countably additive, there is a natural number N for which 


00 
Be (J Ex | <6. 
k=N+1 


By the choice of 5, (47), and the finiteness of each v( E;,) we conclude that 


ie & | — s v( Ex) <e. 
k=1 k=1 
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Thus (46) is verified. Therefore v is a measure and we infer from (48) that if u(£) = 0, then 
v( E) = 0 and thus p is absolutely continuous with respect to p. UO 


The following remarkable theorem tells us that, in the statement of the preceding 
theorem, if the sequence {v,(X)} is bounded, we can dispense with the assumption that 
the sequence is uniformly absolutely continuous: The uniform absolute continuity is a 
consequence of setwise convergence. The proof of this theorem rests beside the Uniform 
Boundedness Principle and the Open Mapping Theorem as one of the exceptional fruits of 
the Baire Category Theorem. 


The Vitali-Hahn-Saks Theorem Let (X, M, ) be a finite measure space and {v,} a 
sequence of finite measures on M, each of which is absolutely continuous with respect to p. 
Suppose that {v,(X)} is bounded and {v,} converges setwise on M to v. Then the sequence 
{v,} is uniformly absolutely continuous with respect to w. Moreover, v is a finite measure on 
M that is absolutely continuous with respect to . 


Proof According to Theorem 21, the Nikodym metric space is complete, and {v,,} induces 
a sequence of continuous functions on this metric space that converges pointwise (that is, 
setwise) to the function v, which is real-valued since {v,(X)} is bounded. We infer from 
Theorem 7 of Chapter 10, a consequence of the Baire Category Theorem, that there is a set 
Eo € M for which the sequence of functions {v,,: M — R} is equicontinuous at Eo, that is, 
for each € > 0, there is a 6 > 0 such that for each measurable set E and natural number 7, 


if py (E, Eo) <6, then lun ( E) — Hn( Eo )| <€. 


Since this holds for every « > 0 and each », is finite, we infer from Lemma 22 that for each 
€ > 0, there is a 6 > 0 such that for each measurable set E and natural number n, 


if py(E) <6, then p,(E) <e. 


Hence {v,,} is uniformly absolutely continuous. According to the preceding theorem, v is a 
finite measure that is absolutely continuous with respect to p. L 


Remark Of course, sigma algebras are not linear spaces and measures are not linear 
operators. Nevertheless, there is a striking similarity between the Vitali-Hahn-Saks Theorem 
and the continuity of the pointwise limit of a sequence of continuous linear operators, and 
the Baire Category Theorem is the basis of the proofs of both these results. Also observe the 
similarity between Lemma 22 and the uniform continuity of a linear operator if it is continuous 
at a point. 


Theorem 26 (Nikodym) Let (X, M) be a measurable space and {v,} a sequence of finite 
measures on M that converges setwise on M to the set function v. Assume {v,(X )} is bounded. 
Then v is a measure on M. 


Proof For a measurable set E, define 


u(E)= 3 = -»(£). (49) 
n=1 
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We leave the verification that w is a finite measure on M as an exercise. It is clear that 
each v, 1s absolutely continuous with respect to y. The conclusion now follows from the 
Vitali-Hahn-—Saks Theorem. CJ 


61. 


62. 


63. 
64. 
65. 


66. 


67. 


68. 


69. 
70. 


PROBLEMS 
For two measurable sets A and B, show that AA B = [AU B]~[A N B] and that 


pu{A, B) =p(A) + p(B) —2-p(AN B). 


Let {A,} be a sequence of measurable sets that converges to the measurable set Ag with 
respect to the Nikodym metric. Show that Ag = U%, [MP2 Ax]. 


n=1 
Show that (49) defines a measure. 


Prove Proposition 24. 


Let (X, M, w) be a finite measure space and v: M -> [0, 00) a finitely additive set function 
with the property that for each € > 0, there is a 6 > 0 such that for a measurable set E, if 
p(E) < 6, then v( E) < «. Show that v is a measure on M. 


Let (X, M) be a measurable space and {v,} a sequence of finite measures on M that 
converges setwise on M to v. Let {E;} be a descending sequence of measurable sets with 
empty intersection. Show that for each ¢ > 0, there is a natural number K for which 
v,( E,) <€ for all n. 


Give an example of a decreasing sequence {1,} of measures on a measurable space such that 
the set function yz defined by u( E) = lim p,( E) is not a measure. 


Let (X, M) be a measurable space and {,} a sequence of measures on M such that for each 
E € M, tnyi(E) > pn(E). For each E € M, define u(E) = lim pa( E). Show that p is a 
measure on M if w(x) < oo. 

Formulate and prove a version of the Vitali-Hahn—Saks Theorem for signed measures. 


Show that the Nikodym metric space associated with the finite measure space (X, M, pw) is 
separable if and only if L?(X, 2) is separable for all 1 < p < oo. 
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For a measure space (X, M, w) and 1 < p < ov, we define the linear spaces L?(X, p) 
just as we did in Part I for the case of Lebesgue measure on the real line. Arguments very 
similar to those used in the case of Lebesgue measure on the real line show that the Holder 
and Minkowski Inequalities hold and that L?(X, 1.) is a Banach space. We devote the first 
section to these and related topics. The remainder of this chapter is devoted to establishing 
results whose proofs lie outside the scope of ideas presented in Part I. In the second section, 
we use the Radon-Nikodym Theorem to prove the Riesz Representation Theorem for the 
dual space of L?(X, w), for 1 < p< oo and » a o-finite measure. In the third section, we 
show that, for 1< p<oo, the Banach space L?(X, w) is reflexive and therefore has the weak 
sequential compactness properties possessed by such spaces. In the following section, we 
prove the Kantorovitch Representation Theorem for the dual of L(X, w). The final section 
is devoted to consideration of weak sequential compactness in the nonreflexive Banach space 
L'(X, w). We use the Vitali- Hahn—Saks Theorem to prove the Dunford-Pettis Theorem, 
which tells us that, if 4( X ) < oo, then every bounded sequence in L!(X, ) that is uniformly 
integrable has a weakly convergent subsequence. 


19.1 THE COMPLETENESS OF L(x, 1),1 < p <0 


Let (X, M, w) be a measure space. Define F to be the collection of all measurable extended 
real-valued functions on X that are finite almost everywhere on X. Since a function that is 

" integrable over X is finite a.e. on X, if f is a measurable function on X and there is a p in 
(0, co) for which f, | f|? du < oo, then f belongs to F. Define two functions f and g in F 
to be equivalent, and write 


f = g provided f = gae.on X. 


This is an equivalence relation, that is, it is reflexive, symmetric, and transitive. Therefore 
it induces a partition of F into a disjoint collection of equivalence classes. We denote 
this collection of equivalence classes by ¥/~. There is a natural linear structure on F/~. 
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Given two equivalence classes [ f] and [g] and real numbers a and B, we define the linear 
combination a -[f]+ 8 -[g] to be the equivalence class of the functions belonging to F that 
take the value a f(x) + Bg(x) on Xo, where Xo is the set of points in X at which both f and 
g are finite. Observe that linear combinations of equivalence classes are properly defined in 
that they are independent of the choice of representatives of the equivalence classes. The 
zero element of this linear space is the equivalence class of functions that vanish almost 
everywhere on X. 
Let L?(X, yw) be the collection of equivalence classes [ f] for which 


[ise < 00. 


This is properly defined since if f = f,, then | f|? is integrable over X if and only if | f,|? is. 
We infer from the inequality 


lat+ bl? < 2?[\a|? + |b|?] for alla,be R 


and the integral comparison test that L?(X, y) is a linear space. For an equivalence class [ f] 
in L?(X, w) we define ||[ f]||p by 


1/p 
IL/lllp = | [ si ay | 


This is properly defined. It is clear that ||[ f]|| p = Oifand only if[f] = 0 and ||[a-: f] Il p=a-|I[ fll 
for each real number a. 

We call an equivalence class [ f] essentially bounded provided there is some M > 0, 
called an essential upper bound for [ f], for which 


| f]| < M ae. on X. 


This also is properly defined, that is, independent of the choice of representative of the 
equivalence class. We define L©(X, 1) to be the collection of equivalence classes [ f] for 
which f is essentially bounded. Then L™(X, 1) also is a linear subspace of F/=. For 
[f] € L©(X, pw), define ||[f]lloo to be the infimum of the essential upper bounds for f. This 
is properly defined. It is easy to see that ||[f]|loo is the smallest essential upper bound for f. 
Moreover, ||[ f]lloo = 0 if and only if [f] = 0 and ||[a- f]llo=a- |I[ f]lloo for each real number 
a. We infer from the triangle inequality for real numbers that the triangle inequality holds 
for || - ||. and hence it is a norm. 

For simplicity and convenience, we refer to the equivalence classes in ¥/~ as functions 
and denote them by f rather than [f]. Thus to write f = g means that f(x) = g(x) for 
almost all x € X. 

Recall that the conjugate g of a number p in (1, oo) is defined by the relation 
1/p +1/q = 1; we also call 1 the conjugate of oo and oo the conjugate of 1. 

The proofs of the results in this section are very similar to those of the corresponding 
results in the case of Lebesgue integration of functions of a real variable. 


Theorem1 Let (X, M, ») bea measure space, 1 < p <0, and q the conjugate of p. If f be- 
longs to L?(X, ) and g belongs to L4(X, w), then their product f .g belongs to L'(X, ) and 
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Holder’s Inequality 
fis -8ldu=Iif- gli <i filp- Walle. 


Moreover, if f #0, the function f* = Ifllp? . sen( f) -|f|?~! belongs to L4(X, w), 


[ tf du= Ifllp and || f"llq =1. (1) 


Minkowski’s Inequality For 1 < p < cand f,gé€L?(X, p), 


If + sllp < fly + Ilgllp- 
Therefore L?(X, w) is anormed linear space. 


The Cauchy-Schwarz Inequality Let f and g be measurable functions on X for which f? 
and g’ are integrable over X. Then their product f - g also is integrable over X and, moreover, 


f iteian<,|f Paws} ean. 


Proof If p = 1, then Hdélder’s Inequality follows from the monotonicity and homogeneity 
of integration, together with the observation that ||g||. is an essential upper bound for g. 
Equality (1) is clear. Assume p > 1. Young’s Inequality asserts that for nonnegative real 
numbers a and b, 
ab < t gh +b! 
Pp q 
Define a = f, | f|? du and B = fy |gl? du. Assume a and B are positive. The functions f and 
gare finite a.e. on X. If f(x) and g(x) are finite, substitute | f(x)|/a!/? for a and |g(x)|/B\/? 
for b in Young’s Inequality to conclude that 
1 1 1 


“Tp aig lf (*)8(*)I S 


1 1 
: Pp —.—. qd X. 
al/p . pila lf(x)| +o 3 |e(x)|? for almost all x € 


Integrate across this inequality, using the monotonicity and linearity of integration, and 
multiply the resulting inequality by a'/? . B'/4 to obtain Hélder’s Inequality. Verification of 
equality (1) is an exercise in the arithmetic of p’s and q’s. To verify Minkowski’s Inequality, 
since we already established that f + g belongs to L?(X, ), we may consider the associated 
function ( f+ g)* in L4(X, yw) for which (1) holds with f + g substituted for f. According to 
Holder’s Inequality, the functions f-(f+g)*+g-(f+g)* are integrable over X. Therefore, 
by the linearity of integration and another employment of Hélder’s Inequality, 


[ (740): +8y" de 


If + slip 


[ f-rtarant fo (tte) te 
»¢ »¢ 


lA 


Ifllp- WF +8)" lg + llgllp- MF +8)" lg 


lfllp + llsllp- 
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Of course, the Cauchy-Schwarz Inequality is Minkowski’s Inequality for the case p = q = 2. 
L 


Corollary 2 Let (X, M, ) be a finite measure space and 1 < p, < po < ©. Then 
LP2(X, 4) CL?!(X, w). Moreover, for 


P2= PL 1 
c=[w(X)] 1” if pr < co andc = [w(X)]” if pz = 00, (2) 
fll, <cllf\lp, for all f in LP( xX). (3) 
Proof For f ¢ L”?(X, ), apply Hélder’s Inequality, with p = p2 and g = 1 on X, to 
confirm that (3) holds for c defined by (2). L 


Corollary 3 Let (X, M, «) be ameasure space and 1< p < ov. If { f,} is a bounded sequence 
of functions in L?(X, w), then { f,} is uniformly integrable over X. 


Proof Let M > 0 be such that || f ||, < M for all n. Define y = 1if p= oo and y= (p—1)/p 
if p < oo. Apply the preceding corollary, with pj = 1, pp = p, and X = E, a measurable 
subset of X of finite measure, to conclude that for any measurable subset EF of X of finite 
measure and any natural number n, 


[ told <M - [a e)P 
E 
Therefore { f,,} is uniformly integrable over X. LJ 


For a linear space V normed by || - ||, we call a sequence {v;,} in V rapidly Cauchy 
provided there is a convergent series of positive numbers }7° , €, for which 


lues1 — vell < é for all natural numbers k. 


We observed earlier that a rapidly Cauchy sequence is Cauchy and that every Cauchy 
sequence has a rapidly Cauchy subsequence.! 


Lemma 4 Let (X, M, ) be a measure space and 1 < p < ov. Then every rapidly Cauchy 
sequence in L?(X, 2) converges to a function in L?(X, .), both with respect to the L?(X, ) 
norm and pointwise almost everywhere on X. 


Proof We leave the case p = oo as an exercise. Assume 1 < p < oo. Let yi && be a 
convergent series of positive numbers for which 


lfc+1 — Sillp S eé for all natural numbers k. (4) 
Then 
p 
OO 
| \fnok — fal?du < | > éj for all natural numbers n and k. (5) 
Xx j=n 


| See the footnote on page 146 regarding rapidly converging Cauchy sequences. 
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Fix a natural number k. According to Chebychev’s Inequality, 
1 
€ 


1 
pfx ex | lfeat(x) — fx(x)|? > cf} < P Mi — frl? du = = Who — fell, (6) 


and therefore 
we{xeX | | feri(x) — fe(x)| = ee} < €? for all natural numbers k. 


Since p > 1, the series °°, ef converges. The Borel-Cantelli Lemma tells us that there is a 
subset Xo of X for which w( X ~ Xo) = Oand for each x € Xo, there is an index K(x) such that 


lferi(x) — fe(x)| < €, for all k > K(x). 


Hence, for x € Xo, 


lfntk(x) — fn(x)| < S €; for alln > K(x) and allk. (7) 


j=n 


The series pal €; converges, and therefore the sequence of real numbers { f; (x)} is Cauchy. 


The real numbers are complete. Denote the limit of {f;(x)} by f(x). Define f(x) = 0 for 
x € X~Xo. Taking the limit as k > oo in (5) we infer from Fatou’s Lemma that 


P 
[o @) 
[it~ fray = Yj for all n. 
x jan 


Since the series }~., ef converges, f belongs to L?(X) and {f,} > f in L?(X). We 
constructed f as the pointwise limit almost everywhere on X of { fn}. L 


The Riesz-Fischer Theorem Let (X, M, 1) be a measure space and 1 < p < oo. Then 
L?(X, w) is a Banach space. Moreover, if a sequence in L?( X, w) converges in L?(X, 1) to 
a function f in L?, then a subsequence converges pointwise a.e. on X to f. 


Proof Let {f,,} be a Cauchy sequence in L?(X, 4). To show that this sequence converges to 
a function in L?(X, 4), it suffices to show it has a subsequence that converges to a function 
in LP(X, w). Choose { f,,} to be a rapidly Cauchy subsequence of { f,,}. The preceding lemma 
tells us that {f,,} converges to a function in L?(X, 4.) both with respect to the L’(X, p) 
norm and pointwise almost everywhere on X. O 


Theorem 5 Let (X, M, «) be a measure space and1 < p < oo. Then the subspace of simple 
functions on X that vanish outside a set of finite measure is dense in L?( X, p). 


Proof Let f belong to L?(X, yw). According to Proposition 9 of the preceding chapter, {x € 
X | f(x) #0} is o-finite. We therefore assume that X is o-finite. The Simple Approximation 
Theorem tells us that there is a sequence {,} of simple functions on X, each of which 
vanishes outside a set of finite measure, which converges pointwise on X to f and for which 
lv, | <|f| on X for all n. Then 


ln — fl? <2? -|f|? on X for all n. 
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Since |f|? 1s integrable over X, we infer from the Lebesgue Dominated Convergence 
Theorem that the sequence {w,,} converges to f in L?(X, pw). LJ 


We leave the proof of the following consequence of the Vitali Convergence Theorem 
as an exercise (see Corollary 18 of the preceding chapter). 


The Vitali L? Convergence Criterion Let (X, M, ) be a measure space and 1 < p< oo. 
Suppose { f,,} is a sequence in L?( X, yw) that converges pointwise a.e. to f and f also belong 
to L?(X, w). Then {f,} > f in L?(X, pw) if and only if {| f|?} is uniformly integrable and 
tight. 


PROBLEMS 


1. For 1 < p< o and a natural number, define f,(x) =n!/? if0 < x <1/nand f,(x) = Oif 
1/n <x <1. Let f be identically zero on [0, 1]. Show that {f,,} converges pointwise to f but 
does not converge in L?. Where does the Vitali Convergence Criterion in L? fail? 


2. For 1 < p< oo andn a natural number, let f,, be the characteristic function of [n, n + 1]. Let 
f be identically zero on R. Show that { f,,} converges pointwise to f but does not converge in 
L?. Where does the Vitali L’ Convergence Criterion fail? 


3. Let (X, M, w) be a measure space and 1 < p < oo. Let { f,} be a sequence in L?(X, y) and 
f afunction L?(X, «) for which { f,} — f pointwise a.e. on X. Show that 


{fn} > f in L?(X, yw) if and only if the sequence {| f|} is uniformly integrable and tight. 
4. For a measure space (X, M, uw) and 0 < p <1, define L?(X, yw) to be the collection of 
measurable functions on X for which | f|? is integrable. Show that L?(X, w) is a linear space. 


For f € L?(X, w), define | filb = fy Lf? du. 
(i) Show that, in general, || - ||, is not a norm since Minkowski’s Inequality may fail. 


(ii) Define 
pf, g)= | |f — gl? du for all f, g € L?(X, p). 
x 
Show that p is a metric with respect to which L?(X, w) is complete. 


5. Let (X, M, w) be a measure space and {f,,} a Cauchy sequence in L™(X, y). Show that 
there is a measurable subset X of X for which w( X~Xo ) = 0 and for each € > 0, there is an 
index N for which 

fn — fm| < € on Xo for alln,m > N. 


Use this to show that L™(X, yw.) is complete. 


19.2 THE RIESZ REPRESENTATION THEOREM FOR THE DUAL OF L’(X, 11),1 < p < 00 


For 1 < p< o, let f belong to L4(X, w), where g is conjugate of p. Define the linear 
functional Ty: L?(X, ») > R by’ 


T;(g) = | tea for all g ¢ L?(X, yp). (8) 


Bear in mind that the “functions” are, in fact, equivalence classes of functions. This functional is properly 
defined on the equivalence classes since if f = f and g = £ ae. on X. then 


[ f-eau= fF aay 
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Holder’s Inequality tells us that Ty is a bounded linear functional on L” and its norm is at 
most || f||,, while (1) tells us that its norm is actually equal to || f||,. Therefore T: L4(X, w) > 
(L?(X, w))* is an isometry. In the case that X is a Lebesgue measurable set of real numbers 
and yt is Lebesgue measure, we proved that T maps L4(X, w) onto (L?(X, w))*, that is, 
every bounded linear functional on L?(X, w) is given by integration against a function in 
L4(X, w). This fundamental result holds for general o-finite measure spaces. 


The Riesz Representation Theorem for the Dual of L?(X,p) Let (X, M, ») be ao-finite 
measure space, 1 < p<, and q the conjugate to p. For f € L4(X, w), define Ty € 
(L?(X, w))* by (8). Then T is an isometric isomorphism of L4(X, w) onto (L?(X, pw) )*. 


Before we prove this theorem, a few words are in order contrasting the proof in the 
case of a closed, bounded interval with the general proof. In the case of Lebesgue measure 
m on X = [a, b], a closed, bounded interval of real numbers, the heart of the proof of 
the Riesz Representation Theorem lay in showing if S is a bounded linear functional on 
L?([a, b], m), then the real-valued function x+> h(x) = S(X{a, xj) 1s absolutely continuous 
on [a, b]. Once this was established, we inferred from the characterization of absolutely 
continuous functions as indefinite integrals that 


S(X{a.x)) = h(x) = | h'dm for all x € [a, D]. 
[a,x] 
From this we argued that h’ belonged to L? and 
s(g)= | h’ - gdm for all g € L”([a, bj, m). 
[a, 5] 


In the case of a general finite measure space, if S is a bounded linear functional on L’(X, 2), 
we will show that the set function E->v(E) = S(ye) is a measure that is absolutely 
continuous with respect to w. We then define f to be the Radon-Nikodym derivative of v 
with respect to py, that is, 


S(xe)= | f dp for all Ee M. 
E 


We will argue that f belongs to L? and 
S(g) -| f - gdp for all g € L?(X, p). 
xX 


Lemma 6 Let (X, M, ) be a o-finite measure space and 1 < p < oo. For f an integrable 
function over X, suppose there is an M > 0 such that for every simple function g on X that 
vanishes outside of a set of finite measure, 


f tean 


Then f belongs to L4( X, w), where q is conjugate of p. Moreover, || f\lq < M. 


<M: \lgllp. (9) 
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Proof First consider the case p > 1. Since |f| is a nonnegative measurable function and 
the measure space is o-finite, according to the Simple Approximation Theorem, there is a 
sequence of simple functions {¢,}, each of which vanishes outside of a set of finite measure, 
that converges pointwise on X to |f| and 0 < @, < |f| on E for all n. Since {pi} converges 
pointwise on X to |f|%, Fatou’s Lemma tells us that to show that |/|? is integrable and 
Il flly < M it suffices to show that 


i] pt dp < M? for alln. (10) 
X 


Fix a natural number n. To verify (10), we estimate the functional values of gas follows : 
h=on-Oh' SIfl-of' =f -sen(f) ef on X, (11) 
Define the simple function g, by 
gn = sgn( f)- ef" on X. 
We infer from (11) and (9) that 


[ otaus | F-o.du<Misoly (12) 
xX xX 


Since p and gq are conjugate, p(q — 1) = q and therefore 


—j 
J leui?du= fof" )au= | ofan 
xX X X 


Thus we may rewrite (12) as 


1/p 
[stdncm| [ sae . 
x xX 


For each n, ¢} is a simple function that vanishes outside of a set of finite measure and 
therefore it is integrable. Thus the preceding integral inequality may be rewritten as 


1-—1/p 
[ otay <M. 
X 


Since 1 — 1/p = 1/q, we have verified (10). 


It remains to consider the case p = 1. We must show that M is an essential upper 
bound for f. We argue by contradiction. If M is not an essential upper bound, then there 
is some € > 0 for which the set X. = {x € X||f(x)| > M + €} has nonzero measure. Since 
X is o-finite, we may choose a subset of X, with finite positive measure. If we let g be the 
characteristic function of such a set we contradict (9). O 


402 Chapter19 General L? Spaces: Completeness, Duality, and Weak Convergence 


Proof of the Riesz Representation Theorem We leave the case p = 1 as an exercise (see 
Problem 6). Assume p > 1. We first consider the case u(X) < oo. Let S$: L?(X, w) > R be 
a bounded linear functional. Define a set function v on the collection of measurable sets M 
by setting 

v(E) = S(ye) for Ee M. 


This is properly defined since 4(X) < oo and thus the characteristic function of each 
measurable set belongs to L?(X, «). We claim that v is a signed measure. Indeed, let 
{Ex}, be a countable disjoint collection of measurable sets and E = Ure, Ex. By the 
countable additivity of the measure p., 


e@) 
w(E) = >) w( Ex) < 00 
k=1 
Therefore x 
lim S p( Ex) =0 
n—>0O kon 
Consequently, 
0 20 ip 
dim xe — > XEullp - | > !(Ex)} =0. (13) 
k=1 k=n+1 


But S is both linear and continuous on L?(X, w) and hence 


S(xe) = 3. S(xm,), 
k=1 


that is, 
v(E)= S v( Ex). 
k=1 


To show that v is a signed measure it must be shown that the series on the right converges 
absolutely. However, if, for each k, we set cy, = sgn(S(yz,)), then arguing as above we 
conclude that the series 


10, 6) (o.@) (o@) 
> S(ck + X,) is Cauchy and thus convergent, so 5 |v(Ex)| = 5. S(cx- vz, ) converges. 
k=l k=l k=l 


Thus v is a signed measure. We claim that v is absolutely continuous with respect to p. 
Indeed, if E € M has u( E) = 0, then y¢ is a representative of the zero element of L?(X, p) 
and therefore, since S is linear, »,(E) = S(yz) = 0. According to Corollary 20 in Chapter 


18, a consequence of the Radon-Nikodym Theorem, there is a function f that is integrable 
over X and 


S(VE) = v(E) = | fautoral Ee M 


For each simple function g, by the linearity of S and of integration, since each simple function 
belongs to L?(X, p), 


S(g) = [ fo dp. 
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Since the functional S$ is bounded on L?(X, w), |S(g)| < || Sllllgllp, for each g € L?(X, pw). 
Therefore, 


/ fo au = |S(¢)| < IISIlllel]p for each simple function ¢, 
x 


and consequently, by Lemma 6, f belongs to L¢?. From Hélder’s Inequality and the continuity 
of S on L?(X, »), we infer that the functional 


gs(e)- | f - gdp for all g € L?(X, p) 
x 


is continuous. However, it vanishes on the linear space of simple functions that, according to 
Theorem 5, is a dense subspace of L?(X, 1). Therefore S — Ty vanishes on all of L?(X, w), 
that is, S = Ty. 

Now consider the case that X is o-finite. Let {X,} be an ascending sequence of 
measurable sets of finite measure whose union is X. Fix n. We have just shown that there is 
a function f, in L4(X pw) for which 


fn =90n X~X, | \futtdu < isi 
X 


and 
s(e)= | fg du= | fng duwif g € L?(X, w) and g =0o0n X~X;z. 
Xn X 


Since any function f,, with this property is uniquely determined on X, except for changes 
on sets of measure zero and since the restriction of f,,; to X;, also has this property, we 
may assume fy41 = fn On X,. For x € X =U, Xn, set f(x) = fu(x) if x belongs to Xz. 
Then f is a properly defined measurable function on X and the sequence {| f,|7} converges 
pointwise a.e. to | f|?. By Fatou’s Lemma, 


| fl4du < liming | fal@dy < ISI 
xX xX 


Thus f belongs to L7. Let g belong to L?(X, w). For eachn, define g, = g on X, and gy, = 0 
on X~X,,. Since, by Hélder’s Inequality, | fg| is integrable over X and |fgn| < | fg| ae. on 
X, by the Lebesgue Dominated Convergence Theorem, 


Jim, [ fen du = [ fed. (14) 


On the other hand, {|g, — g|?} > 0 pointwise a.e. on X and |g, — g|? < |g|? ae. on X, for all 
n. Once more invoking the Lebesgue Dominated Convergence Theorem, we conclude that 
{gn} > gin L?(X, dp). Since the functional S is continuous on L?(X, IL), 


him S(8n) = S(g). (15) 
However, for each n, 
S(8n ) = | fn8ndp= | f8n dp, 
Xn x 
so that, by (14) and (15), S(g) = fy fg du. O 
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PROBLEMS 


6. Prove the Riesz Representation Theorem for the case p = 1 by adapting the proof for the 
case p > 1. 


7. Show that for the case of Lebesgue measure on a nontrivial closed, bounded interval [a, b], 
the Riesz Representation Theorem does not extend to the case p = oo. 


8. Find a measure space (X, M, 2) for which the Riesz Representation Theorem does extend 
to the case p = oo. 


19.3. THE KANTOROVITCH REPRESENTATION THEOREM FOR THE DUAL OF L™(X, 2) 


In the preceding section, we characterized the dual of L?(X, w) for 1 < p< oo and 
(X, M, w) ao-finite measure space. We now characterize the dual of L®(X, 2). 


Definition Let (X, M) be a measurable space and the set function v: M > R be finitely 
additive. For E € M, the total variation of v over E, |v|( E), is defined by 


ol(E) = sup Spl Ex) (16) 


k=1 


where the supremum is taken over finite disjoint collections {E; hi, Of sets in M that are 
contained in E. We call v a bounded finitely additive signed measure provided |v|(X) < ov. 
The total variation of v over X, which is denoted by ||v||yar, is defined to be |v|\(X). 


Remark [fv: M — _R is a measure, then ||v||yar = v(X). If v: M > R is a signed measure, 
we already observed that the total variation ||v|| yar is given by 


I llvar = |vl(X) = v*(X) +H" (X), 


where v = v* — v7 is the Jordan Decomposition of v as the difference of measures (see 
page 345). For a real-valued signed measure v, an analysis (which we will not present here) 
of the total variation set function |v| defined by (16) shows that \v| is a measure. Observe that 
|»| — v also is a measure and v = |v| — ||v| — v]. This provides a different proof of the Jordan 
Decomposition Theorem for a finite signed measure. 


If v: M -> Ris a bounded finitely additive signed measure on M, and the simple function 


n 
~ = % Ck Xe, 1s measurable with respect to M, we define the integral of g over X with 
l 


respect to v by 


| gdv = S ch V( Ex). 
Xx 


k=l 
The integral is properly defined, linear with respect to the integrand and 


[ea 
xX 


Indeed, in our development of the integral with respect to a measure p, only the finite 
additivity of 4 was needed in order to show that integration is a properly defined, linear, 


< |lvllvar - [l@lloo. (17) 
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monotone functional on the linear space of simple functions. Let f: X + R be a measurable 
function that is bounded on X. According to the Simple Approximation Lemma, there are 
sequence {,} and {¢,,} of simple functions on X for which 


On < Ona < f < Wp41 < hy and 0 < by, — Gy, < 1/n on X for all n. 


Therefore the sequence {y,} converges uniformly to f on X. We infer from (17) that 


/ Pn+k av — | @n dv 
X X 


We define the integral of f over X with respect to v by 


< |v llvar « ll@n+k — Gn|loo for all natural numbers n and k. 


| fdv= lm | 9,dv 
xX xX 


n> &© 


This does not depend on the choice of sequence of simple functions that converges uniformly 
on X to f. Now let (X, M, w) be a measure space. We wish to define [ y fd for 
f € L™(X, w), now formally viewed as a linear space of equivalence classes of essentially 
bounded measurable functions with respect to the relation of equality a.e. [u]. This requires 
that f, fdv = fy fidv if f = f; a.e. [u] on X. If there is a set E € M for which u(E) = 0, 
but »v( E) #0, then clearly this does not hold. We therefore single out the following class of 
bounded finitely additive signed measures. 


Definition Let (X, M, uw) be a measure space. By BF A(X, M, uw) we denote the normed 
linear space of bounded finitely additive signed measures v on M that are absolutely continuous 
with respect to ys in the sense that if E € M and p(E) = 0, then v(E) = 0. The norm of 
vp € BF A(X, M, w) is the total variation norm |\v\\ var. 


It is clear that if vy belongs to BF A(X, M, yw) and ¢ and w are simple functions that are 
equal a.e. [4] on X, then f, gdv = f y dv and hence the same is true for essentially bounded 
measurable functions that are equal a.e. [u] on X. Therefore the integral of an L°(X, yw) 
function (that is, class of functions) over X with respect to v is properly defined and 


| ta» 


Theorem 7 (the Kantorovitch Representation Theorem) Let (X, M, w) be a measure 
space. Forv € BF A(X, M, mp), define T,: L™(X, w) > R by 


< |lvllvar - II flloo for all f € L(X, w) andy e BF A(X, M, pw). — (18) 


T,( f) = | fav foralt f Ee L™(X, pw). (19) 


Then T is an isometric isomorphism of the normed linear space BF A(X, M, p) onto the 
dual of L®(X, p). 


Proof We first show that T is an isometry. In view of inequality (18), it suffices to show that 
lvilvar < \IZ,l|. Indeed, let {E;}/_, be a disjoint collection of sets in M. For 1 < k <n, define 
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cx = sgn(v( E;,)) and then define g = > Ck: XE, Then ||Plloo = 1. Thus 
k=1 


Sb (Ev) =| pdv = 7,() < [Toll 
k=1 Xx 


Therefore ||v||var < ||7_|| and hence T is an isometry. It remains to show that T is onto. Let 
S belong to the dual of L°(X, v). Define v: M > R by 


v(E) = S(xz) forall Ee M. (20) 


The functon yz belongs to L®(X, v) and therefore v is properly defined. Moreover, pv is 
finitely additive since S is linear. Furthermore, we claim that v is absolutely continuous with 
respect to wu. Indeed, let E € M have p(E) = 0. Thus »(E) = S(xz) = 0. We infer from 
the linearity of S and of integration with respect to v that 


f dv = S(f) for all simple functions in L(X, 1). 
X 


The Simple Approximation Lemma tells us that the simple functions are dense in L®(X, w). 
Therefore, since both S and integration with respect to v are continuous on L™(X, w), 


Remark Let [a, b] be a closed, bounded interval of real numbers and consider the Lebesgue 
measure space ({a, b], L, m). The operator T: L+({a, b], m) > [L™®([a, b], m)]* given by 


T,(f) = i yo Pam forall g € L'([a, b], m) and f € L®([a, b], m) 


is a linear isomorphism. Moreover, L'({a, b], m) is separable and therefore so is T(L'([a, b], 
m)). On the other hand, L™({a, b],m) is not separable. According to Theorem 13 of 
Chapter 14, if the dual of a Banach space V is separable, then V also is separable. Therefore 
T(L‘([a, b],m)) is a proper subspace of [L™([a, b], m)]*. We therefore infer from the 
Kantorovitch Representation Theorem that there is a bounded finitely additive signed measure 
vy on M that is absolutely continuous with respect to m but for which there is no function 
g €L'([a, b], m) for which 


| f dv =| g-fdm forall f € L*({a, bl, m). (21) 
[a, b} [a, b| 


The set function v cannot be countably additive since if it were, according to Corollary 20, 
there would be an L'({a, b], m) function g for which (21) holds. Thus v is a bounded set 
function on the Lebesgue measurable subsets of {a, b], is absolutely continuous with respect 
to Lebesgue measure, is finitely additive but not countably additive. No such Set function has 
been explicitly exhibited. 


Here we need to return to the formal definition of L©(X, 2) as equivalence classes of functions with respect 
to the equivalence of equality almost everywhere [j] and recognize that S is defined on these equivalence classes. 
Since yf is the representative of the zero equivalence class and S is linear, S(yz¢) =0. 
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PROBLEMS 
In the following problems (X, M, yw) is a complete measure space. 


9. Show that BF. A(X, M, yw) is a linear space on which || - ||var is a norm. Then show that this 
normed linear space is a Banach space. 


10. Let v: M —> R be a signed measure and (X, M, w) be o-finite. Show that there is a function 
f € L'(X, p) for which 


[ gav= | ¢- Fay forallg L(x, n) 
X X 


11. Let {v,} be a bounded sequence in BF A([a, b], £, m). Show that there is a subsequence 
{vy,} andy € BF A([a, b], £, m) such that 


lim | f din, =| f dv forall f € L'([a, b], m). 

k->oo Jy xX 

12. Let {41,} be a sequence of measures on the Lebesgue measurable space ([a, b], £) for 
which {;([a, b])} is bounded and each v,, is absolutely continuous with respect to Lebesgue 
measure m. Show that a subsequence of {u,} converges setwise on M to a measure on 
({a, b], £) that is absolutely continuous with respect to m. 


19.4 WEAK SEQUENTIAL COMPACTNESS IN L(X, 11),1 < p <1 


Recall that for X a normed linear space, the dual space of bounded linear functionals on X 
is denoted by X* and the dual of X* is denoted by X**. The natural embedding J: X — X** 
is defined by 

J(x)[] = w(x) for all x € X, p € X”. 


We inferred from the Hahn-Banach Theorem that the natural embedding 1s an isometry and 
called X reflexive provided the natural embedding maps X onto X**. Theorem 17 of Chapter 
14 tells us that every bounded sequence in a reflexive Banach space has a weakly convergent 
subsequence. 


Theorem 8 Let (X, M, ) be ao-finite measure space and 1 < p< oo. Then L?(X, p)isa 
reflexive Banach space. 


Proof The Riesz Representation Theorem tells us that for conjugate numbers r,s € (1, 00), 
the operator T,: L” > (L*)*, defined by 


b 

[T,-(h)](g) = | g-hforallh e L’ and ge L’, 
a 

is an isometric isomorphism from L’ onto (L*)*. To verify the reflexivity of L? we let 

S:(L?)* + R be a continuous linear functional and seek a function f ¢ L? for which 

S = J(f).4 But observe that So T,: L? > R, being the composition of continuous linear 


4We repeat an earlier caveat pertaining to reflexivity. For a normed linear space X to be reflexive it is not 
sufficient that X be isomorphic to its bidual X**; it is necessary that the natural embedding be an isomorphism of X 
onto X**. See the article by R.C. James, “A non-reflexive Banach space isometric to its second dual,” Proc. Nat. 
Acad. Sci. U.S.A. 37 (1951). 
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operators, also is a continuous linear functional. By the Riesz Representation Theorem, T, 
maps L? onto (L4)*, and hence there is a function f € L? for which S o T, = T,( f), that is 


(So 7, )[g] = Tp( f)[g] for all g € L?. 


Thus 
S(Tq(8)) = Tp( fig] = Ty(8 Lf] = J( £)(Tq(g)) for all g € L4. 
Since T, maps L? onto (L?)*, S= J(f). LI 


The Riesz Weak Compactness Theorem Let (X, M, w) be a o-finite measure space and 
1 < p<ov. Then every bounded sequence in L?( X, 1) has a weakly convergent subsequence; 
that is, if { f,} is a bounded sequence L?(X , -), then there is a subsequence { f,,} of {f,} and 
a function f in L?(X, w) for which 


Jim | In-gdu= | f - gdp forall g € L4(X, w), where 1/p+1/q =1. 


Proof The preceding theorem asserts that L?(X, w) is reflexive. However, according to 
Theorem 17 of Chapter 14, every bounded sequence in a reflexive Banach space has a 
weakly convergent subsequence. The conclusion now follows from the Riesz Representation 
Theorem for the dual of L?(X, yw). L 


In Chapter 8, we studied weak convergence in L?(E, m), where E is a Lebesgue 
measurable set of real numbers and m is Lebesgue measure. In Chapter 14, we studied 
properties of weakly convergent sequences in a general Banach spaces and these, of course, 
hold for weak convergence in L?(X, «). We record here, without proof, three general results 
about weak convergence in L?(X, w), for 1 < p < oo and (X, M, pw) a general o-finite 
measure space. The proofs are the same as in the case of Lebesgue measure on the real line. 


The Radon-Riesz Theorem Let (X, M, w) be a o-finite measure space, 1 < p < 00, and 
{fn} 4 sequence in L?(X, w) that converges weakly in L?(X, w) to f. Then 


{fn} converges strongly in L?(X, pw) to f 


if and only if 
lim {fully = IIflp- 


Corollary 9 Let (X, M, w) be a o-finite measure space, 1 < p < 00 and { fn} a sequence in 
L?(X, w) that converges weakly in L?(X, w) to f. Then a subsequence of { f,} converges 
strongly in L?(X, p) to f if and only if 


Il fllp = lim inf || fallp. 


The Banach-Saks Theorem Let (X, M, w) be ao-finite measure space, 1 < p< oo, and { fy} 
a sequence in L?( X, w) that converges weakly in L?(X, w) to f. Then there is a subsequence 
{ fn, } for which the sequence of Cesdro means converges strongly in L?(X, w) to f, that is, 


lim Sng + fing too + Sry 


k— 00 k 


= f strongly in L?(X, pw). (22) 
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PROBLEMS 


13. A linear functional S$: L?(X, «) > R is said to be positive provided S(g) > 0 for each 
nonnegative function g in L?(X, w). For 1 < p < oo and w o-finite, show that each bounded 
linear functional on L?(X, w) is the difference of bounded positive linear functionals. 

14. Prove the Radon-Riesz Theorem, and the Banach-Saks Theorem in the case p = 2. 


15. Let X be the subspace of L®(R, m), where m is Lebesgue measure, consisting of the 
continuous functions f that have a finite limit as x + oo. For f € X, define S(f) = 
limo f(x). Use the Hahn-Banach Theorem to extend S to a bounded linear functional on 
L®(R, m). Show that there is not a function f in L}([a, b], m) such that 


S(g) =| f - gdm forall ge L™(R, m). 
R 


Does this contradict the Riesz Representation Theorem? 


16. Let wu be the counting measure on the set of natural numbers N. 
(i) For1< p<, show that L?(N, «) =/? and thereby characterize the dual space of £? 
for 1 < p< oo. 


(11) Discuss the dual of L?(X, 1) for 1 < p < 00, where pis the counting measure on a not 
necessarily countable set X. 


17. Find a measure space (X, M, yw.) with the property that all the theorems of this section hold 
in the case p = 1. 


18. Show that for Lebesgue measure on a closed, bounded interval [a, b] of real numbers and 
p = 1, neither the Riesz Weak Compactness Theorem, nor the Radon-Riesz Theorem, nor 
the Banach-Saks Theorem are true. 


19.5 WEAK SEQUENTIAL COMPACTNESS IN L'(X, 4): 
THE DUNFORD-PETTIS THEOREM 


For a measure space (X, M, w), in general, the Banach space L'(X, w) is not reflexive, 
in which case, according to the Eberlein-Smulian Theorem, there are bounded sequences 
in L'(X, «) that fail to have weakly convergent subsequences. It therefore is important 
to identify sufficient conditions for a bounded sequence in L!(X, w) to possess a weakly 
convergent subsequence. In this section we prove the Dunford-Pettis Theorem, which tells 
us that, for 4(X) < oo, if a bounded sequence in L'(X, w) is uniformly integrable, then it 
has a weakly convergent subsequence. Recall that a sequence { f,} in L'(X, w) is said to be 
uniformly integrable provided for each € > 0, there is a 5 > 0 such that for any measurable 
set E, 


if u(E) <6, then | l fal dw <e for all n. 
E 
For finite measure spaces, we have the following characterization of uniform integrability. 


Proposition 10 For a finite measure space (X, M, w) and bounded sequence {fn} in 
L!(X, w), the following two properties are equivalent: 


(i) {fn} is uniformly integrable over X. 
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(ii) For each € > 0, there is an M > 0 such that 


| | fn| < € for all n. (23) 
{xeX || fn(x)|>M} 


Proof Since { f,,} is bounded, we may choose C > 0 such that || f,||1 < C for all n. First assume 
(i). Let € > 0. Choose 5 > 0 such that if E is measurable and (£) < 4, then, [| faldu <e 
for all n. By Chebychev’s Inequality, 


m{xEX| |fr(x)|=M}< ay | Mildu <5 forall 


Hence if M > C/6, then {x € X || fn(x)| = M) <6 and therefore (23) holds. Now assume 
that (ii) holds. Let « > 0. Choose M > 0 such that 


| l fal <€/2 for all n. 
(xX || fn(x)I>M} 


Define 5 = ¢/2M. Then by the choice of M and 6, for any measurable set E, if w( £) <6 and 
n is any natural number, then 


[iidw= | Jaldu+ [ \fuldu <¢€/2+M-p(E) <e. 
E (xeE || fn(x)|=M) {x€E || fn(x)|<M} 


Therefore { f,,} is uniformly integrable over X. L] 


For an extended real-valued measurable function f on X and a> 0, define the 
truncation at level a of f, f le] on X by 


0 if f(x) >a 
fil(x)=d f(x) if -a< f(x) <a 
0 if f(x) <-a. 


Observe that if 4(X) < oo, then for f € L'(X, 4) anda > 0, fl! belongs to L!(X, w) and 
has the following approximation property: 


ik — f'Ndu 


< | Fld. (24) 
{xEX || f(x)|>a} 


Lemma 11 For a finite measure space (X, M, w) and bounded uniformly integrable sequence 
{fa} in L'(X, w), there is a subsequence { fn,} such that for each measurable subset E of X, 


| | Sy | is Cauchy. (25) 
E 


Proof We first describe the centerpiece of the proof. If {g,} is any bounded sequence in 
L'(X , w) and a > 0, then, since 4(X) < oo, the truncated sequence folly is bounded in 
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L?(X, w). The Riesz Weak Compactness Theorem tells us that there is a subsequence {gh} 
that converges weakly in L7(X, w). Since u(X) <oo, integration over a fixed measurable set 
is a bounded linear functional on L*( X, w) and therefore for each measurable subset E of X, 
{ te git! dus is Cauchy. The full proof uses this observation together with a diagonalization 
argument. 

Indeed, let a = 1. There is a subsequence of { f,,} for which the truncation at level 1 
converges weakly in L*(X, w). We can then take a subsequence of the first subsequence 
for which the truncation at level 2 converges weakly in L*(X, ). We continue inductively 
to find a sequence of sequences, each of which is a subsequence of its predecessor and the 
truncation at level k of the kth subsequence converges weakly in L?(X, w). Denote the 
diagonal sequence by {h,}. Then {h,} is a subsequence of { f,,} and for each natural number 
k and measurable subset E of X, 


| nik] | is Cauchy. (26) 
E 


Let E be a measurable set. We claim that 


| hy a is Cauchy. (27) 
E 


Let € > 0. Observe that for natural numbers k, n, and m, 


hn — him =| lel — all] EE — tm + [fn — Alt), 


< 


Therefore, by (24), 
| | [hn — hm] du | nt! nl a + | m| dp + | in| dt. 
E E (x€E | |ltm|(x)>K) (x€E | |htn|(x)>k} 
(28) 


We infer from the uniform integrability of {f,} and Proposition 10 that we can choose a 
natural number kp such that 


| |hn| dp < €/3 for all n. (29) 
{xEE | |hn|(x)>ko} 
On the other hand, by (26) at k = kg, there is an index N such that 
| i [nlkol _ Ae dl <€/3 for alln,m > N. (30) 
E 


We infer from (28), (29), and (30) that 


[Ur ~ hd <é¢€foralln,m>N. 
E 


Therefore (27) holds and the proof is complete. LI 
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Theorem 12 (the Dunford-Pettis Theorem) For a finite measure space (X, M, w) and 
bounded sequence { fn} in L'(X, w), the following two properties are equivalent: 


(i) {fn} is uniformly integrable over X. 


(ii) Every subsequence of {f,} has a further subsequence that converges weakly in 
L'(X, p). | 


Proof First assume (i). It suffices to show that {f,} has a subsequence that converges 
weakly in L'(X, ). Without loss of generality, by considering positive and negative parts, 
we assume that each f, is nonnegative. According to the preceding lemma, there is a 
subsequence of { f,}, which we denote by {h,}, such that for each measurable subset E of X, 


| hy a is Cauchy. (31) 
E 


For each n, define the set function v, on M by 
v,(E) = ] h, dp for all E € M. 
E 


Then, by the countable additivity over domains of integration, v, is a measure and it is 
absolutely continuous with respect to w. Moreover, for each E € M, {v,(E)} is Cauchy. The 
real numbers are complete and hence we may define a real-valued set function v on M by 


lim vn(E) = v(E) for all E € M. 


Since {h,} is bounded in L'(X, w), the sequence {v,(X)} is bounded. Therefore, the Vitali- 
Hahn-Saks Theorem tells us that v is a measure on (X, M) that is absolutely continuous with 
respect to 4. According to the Radon-Nikodym Theorem, there is a function f € L'(X, pw) 
for which 


y(E) = | f du for all Ee M. 
E 
Since 
lim i fn dp =| f dp for all E e M, 
noo E E 
lim fr-edp= | f - gdp for every simple function ¢. (32) 
n> X X 


By assumption, {f,} is bounded in L!(X, w). Furthermore, by the Simple Approximation 
Lemma, the simple functions are dense in L™(X, ). Hence 


tim, fuedu= | f- gdp forall ge L(x, bt), (33) 
n— X X 


that is, { f,} converges weakly in L'(X, p) to f. 
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It remains to show that (11) implies (i). We argue by contradiction. Suppose { f,,} satisfies 
(ii) but fails to be uniformly integrable. Then there is an € > 0, a subsequence {h,} of {fn}, 
and a sequence {E,,} of measurable sets for which 


lim Mn( En) = 0 but | hy, du > €o for all n. (34) 


n 


By assumption (ii) we may assume that {h,} converges weakly in L'(X, w) to h. For each n, 
define the measure v, on M by 


v,(E) = | h, dp for all E € M. 
E 


Then each vy, is absolutely continuous with respect to w and the weak convergence in 
L'(X, w) of {hy} to h implies that 


{v,(E)} is Cauchy for all E € M. 


But the Vitali-Hahn—Saks Theorem tells us that {v,,(£)} is uniformly absolutely continuous 


with respect to yz and this contradicts (34). Therefore (ii) implies (i) and the proof is complete. 
LI 


Corollary 13 Let (X, M, w) be a finite measure space and { f,} a sequence in L'(X, w) that 
is dominated by the function g € L'(X, p) in the sense that 


lfnl < ga@e.on E forall n. 


Then { fn} has a subsequence that converges weakly in L'(X, yw). 


Proof The sequence {f,,} is bounded in L'(X, «) and uniformly integrable. Apply the 
Dunford-Pettis Theorem. LJ 


Corollary 14 Let (X, M, uw) be a finite measure space, 1 < p < ov, and { fn} a bounded 
sequence in L?( X, w). Then { f,} has a subsequence that converges weakly in L'(X, p). 


Proof Since y(X) < oo, we infer from Hélder’s Inequality that { f,} is a bounded sequence 
in L'(X, w) and is uniformly integrable. Apply the Dunford-Pettis Theorem. L] 


PROBLEMS 


19. For a natural number n, let e,, be the sequence whose nth term is 1 and other terms are zero. 
For what values of p, 1 < p < oo, does {e,} converge weakly in £7? 


20. Find a bounded sequence in L!([a, b], m), where m is Lebesgue measure, which fails to have 
a weakly convergent subsequence. 


21. Find a measure space (X, M, uw) for which every bounded sequence in L!(X, w) has a 
weakly convergent subsequence. 


22. Fill in the details of the proof of Corollary 14. 


23. Why is the Dunford-Pettis Theorem false if the assumption that the sequence is bounded in 
L! is dropped? 
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In Chapter 17 we considered the Carathéodory construction of measure. In this chapter 
we first use the Carathéodory-Hahn Theorem to construct product measures and prove the 
classic theorems of Fubini and Tonelli. We then use this theorem to construct Lebesgue 
measure on Euclidean space R” and show that this is a product measure and therefore 
iterated integration is justified. We conclude by briefly considering a few other selected 
measures. 


20.1 PRODUCT MEASURES: THE THEOREMS OF FUBINI AND TONELLI 


Throughout this section (X, A, w) and (Y, B, v) are two reference measure spaces. Consider 
the Cartesian product X x Y of X and Y. If AC X and BC Y, we call A X B a rectangle. If 
A € Aand B € B, we call A X B a measurable rectangle. 


Lemmal Let (A;  B,}?°, be a countable disjoint collection of measurable rectangles whose 
union also is a measurable rectangle A X B. Then 
. 0O 
mA) Xv(B) = Sp Ax) Xv( Be). 
k=1 


Proof Fix a point x € A. For each y € B, the point (x, y) belongs to exactly one Ax X By. 
Therefore we have the following disjoint union: 


B= J Bx. 
{k | xe Ax} 


By the countable additivity of the measure pv, 


v(B)= Sd v(B). 


{k | xEAx} 
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Rewrite this equality in terms of characteristic functions as follows: 


v(B)-xa(x) = s v( By) - Xa,(x) for all x € A. 
k=1 


Since each A; is contained in A, this equality also clearly holds for x € X \ A. Therefore 


OO 
v(B)- xa = dv Be) - XA, on X. 
k=1 


By the Monotone Convergence Theorem, 


w(A) Xv(B) = [92) r0du= > | 80 Xa,du = > m( An) x0( Bu) - 


Proposition 2 Let R be the collection of measurable rectangles in X XY and for a measurable 
rectangle A X B, define 


MAX B) = p(A)-v(B). 


Then R is a semiring and X: R —> [0, oo] is a premeasure. 


Proof It is clear that the intersection of two measurable rectangles is a measurable rectangle. 
The relative complement of two measurable rectangles is the disjoint union of two measurable 
rectangles. Indeed, let A and B be measurable subsets of X and C and D be measurable 
subsets of Y. Observe that 


(AXC) ~ (BX D) =|(A~B) XC] U[(ANB)x(C~D)I, 


and the right-hand union is the disjoint union of two measurable rectangles. 


It remains to show that A is a premeasure. The finite additivity of A follows from the 
preceding lemma. It is also clear that A is monotone. To establish the countable monotonicity 
of A, let the measurable rectangle E be covered by the collection {E;}7°, of measurable 
rectangles. Since R is a semiring, without loss of generality, we may assume that {E;}7°, isa 
disjoint collection of measurable rectangles. Therefore 


oO 
E=|(JENk, 
k=1 


this union is disjoint and each EN EF; is a measurable rectangle. We infer from the preceding 
lemma and the monotonicity of A that 


oO oO 
ME) = DS) MEN Ex) < > A(Ex). 
k=1 k=1 
Therefore A is countably monotone. The proof is complete. L 


This proposition allows us to invoke the Carathéodory -Hahn Theorem in order to 
make the following definition of product measure, which assigns the natural measure, 
p(A) -v(B), to the Cartesian product A X B of measurable sets. 
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Definition Let (X,A, uw) and (Y, B,v) be measure spaces, R the collection of measurable 
rectangles contained in X X Y, and \ the premeasure defined on R by 


A(AX B) = p(A)-v(B) for AX Be R. 


By the product measure \ = y. X v we mean the Carathéodory extension of A: R — [0, oo] 
defined on the o-algebra of (1 Xv)*-measurable subsets of X XY. 


Let E be a subset of X x Y and f a function on E. For a point x € X, we call the set 
E,={yeY| (x,y)e E}CY 


the x-section of E and the function f(x, -) defined on E, by f(x, -)(y) = f(x, y) the 
x-section of f. Our goal now is to determine what is necessary in order that the integral of 
f over X XY with respect to wu x v be equal to the integral over X with respect to yw of the 
function on X that assigns to x € X the integral of f(x,-) over Y with respect to v. This is 
called iterated integration. The following is the first of two fundamental results regarding 
iterated integration. ! 


Fubini’s Theorem Let (X, A, w) and (Y, B, v) be two measure spaces and v be complete. 
Let f be integrable over X XY with respect to the product measure w Xv. Then for almost all 
x € X, the x-section of f, f(x, -), is integrable over Y with respect to v and 


[ _ fd(uxv) = [ | f(x, y) do(y) 


An integrable function vanishes outside a o-finite set. Therefore, by the Simple 
Approximation Theorem and the Monotone Convergence Theorem, the integral of a 
general nonnegative integrable function may be arbitrarily closely approximated by the 
mtegral of a nonnegative simple function that vanishes outside a set of finite measure, that is, 
by a linear combination of characteristic functions of sets of finite measure. Thus the natural 
initial step in the proof of Fubini’s Theorem is to prove it for the characteristic function of a 
measurable subset EF of X X Y that has finite measure. Observe that for such a set, if we let f 
be the characteristic function of EF, then 


dp(x). (1) 


| f d(uXv) = (ux)(E). 
XXY 


On the other hand, for each x € X, f(x,-) = Xz, and therefore if the x-section of E, E,, is 
v-measurable, then 


[ Fo ») a) =o), 
Y 
Therefore, for f = yz, (1) reduces to the following: 
(wxv)(B) = f (Ex) dux). 
X 
1Let Xq be a measurable subset of X for which p(X~Xq) = 0. For a measurable function h on Xq, we write 


f yi dp to denote [ Xo h dy, if the latter integral is defined. This convention is justified by the equality of [ yhdp 
and Xo h dp for every measurable extension of h to X. 
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Proposition 10 of Chapter 17 tells us that a measurable set E C X X Y is contained in 
an Rs set A for which (u Xv)(A~E) = 0. We therefore establish the above equality first 
for Rs sets and then for sets with product measure zero. 


Lemma 3 Let EC X XY be an Rgs Set for which (pp Xv)(E) < oo. Then for all x in X, 
the x-section of E, E,, is a v-measurable subset of Y, the function x» v(E,) forx € X isa 
p-measurable function and 


(uxv)(E) = [ v( Ex) d(x). (2) 


Proof First consider the case that E = A X B, a measurable rectangle. Then, for x € X, 


zr.) B forxe A 
*) @ forx€ A, 


and therefore v( E,) = v( B) - x(x). Thus 


(uxv)(B) = u(A)-9(B)=0(B)- | xadu= f »( Bs) dul) 


We next show (2) holds if E is an Rg set. Since R is a semiring, there is a disjoint collection 
of measurable rectangles {A; X By}, whose union is E. Fix x € X. Observe that 


Thus E,, is y-measurable since it is the countable union of B;,’s, and since this union is disjoint, 
by the countable additivity of v, 


oO 


v(Ex) = > v( (Ag X B)x). 


k=1 


Therefore, by the Monotone Convergence Theorem, the validity of (5) for each measurable 
rectangle A; X B, and the countable additivity of the measure p X v, 


[ ee) aula) => f (Ax Bes du 
x k=1/X 


0° 


= > #(Ak) Xv( By) 


k=1 


= (wxv)(E). 


Thus (2) holds if E is an Rg set. Finally, we consider the case that E is in Rag and use the 
assumption that E has finite measure. Since R is a semiring, there is a descending sequence 
{Ex}V2., of sets in Rg whose intersection is E. By the definition of the measure y. Xv in terms 
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of the outer measure induced by the premeasure yx Xv on R, since (uu Xv)(E) < 00, we may 
suppose that (4 Xv)( E,) < oo. By the continuity of the measure p Xv, 


fim (ue Xv)(Ex) = (wxv)(E). (3) 
Since EF; is an R, set, 
(wxv)(Er)= | ((Er)s) dul), 
X 
and hence, since (uw Xv)( E1) < oo, 
v((E1),) < oo for almost all x € X. (4) 


Now for each x € X, E, is v-measurable since it is the intersection of the descending sequence 
of v-measurable sets {(E;),}?2, and furthermore, by the continuity of the measure v and 
(4), for almost all x € X, 


Jim v((Ex)x) = (Ey). 


Furthermore, the function x» v((£;),) is a nonnegative integrable function that, for each 
k, dominates almost everywhere the function x > v((E;),). Therefore by the Lebesgue 
Dominated Convergence Theorem, the validity of (5) for each R, set E;, and the continuity 


property (3), 


[ v(ee)dulx) = tim f (He): du 
JX oo Jy 


= jim (u Xv)( Ex) 


= (uXv)(E). 
The proof is complete. U 


Lemma 4 Assume the measure v is complete. Let EC X XY be measurable with respect to 
pexv. If (wXv)(E) =0, then for almost all x € X, the x-section of E, E,, is v-measurable and 
v(E,.) = 0. Therefore 


(wxv)(B) = | v(x) du( 
x 
Proof Since (uw Xv)(E) < oo, it follows from Proposition 10 of Chapter 17 that there is a 


set A in Ros for which E C A and (uw Xv)(A) = 0. According to the preceding lemma, for 
all x € X, the x-section of A, A,, is y-measurable and 


(uxv)(A) = [ W(Ax) d(x). 


Thus v( A, ) = 0 for almost all x € X. However, for all x € X, E, C A,. Therefore we may 
infer from the completeness of v that for almost all x € X, E, is v-measurable and v( FE, ) = 0. 
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Proposition 5 Assume the measure v is complete. Let EC X XY be measurable with respect to 
pXv and (wXv)( E) <oo. Then for almost all x in X, the x-section of E, E,, is av-measurable 
subset of Y, the function x->v( E,,) for x € X is a w-measurable function, and 


(wxv)(B) = f (Ex) du), (5) 


X 


Proof Since (uXv)(E) < oo, it follows from Proposition 10 of Chapter 17 that there is a set 
Ain Rgs for which E C A and (uw Xv)( E~A) = 0. By the excision property of the measure 
Xv, (wXv)(E) = (wXv)(A). By the preceding lemma, 

v(A,) =v(E,) + v((A~E),) = v( E,) for almost all x € X. 
Once more using the preceding lemma, we conclude that 


(wXv)(E) =(uXxv)(A) 


The proof is complete. LJ 


Theorem 6 Assume the measure v is complete. Let p: X X Y — R be a simple function that 
is integrable over X XY with respect to 4 Xv. Then for almost all x € X, the x-section of ¢, 
g(x, -), is integrable over Y with respect to v and 


[ eauxv) = ff [os y) dv(y) 


Proof The preceding proposition tells us that (6) holds if @ is the characteristic function 
of a measurable subset of X x Y of finite measure. Since ¢ is simple and integrable, it is a 
linear combination of characteristic functions of such sets. Therefore (6) follows from the 
preceding proposition and the linearity of integration. 


du(x). (6) 


Proof of Fubini’s Theorem Since integration is linear, we assume that f is nonnegative. The 
Simple Approximation Theorem tells us that there is an increasing sequence {¢,} of simple 
functions that converges pointwise on X x Y to f and, for each k, 0 < g < fon X XY. Since 
f is integrable over X X Y, each gy, is integrable over X X Y. According to the preceding 
proposition, for each k, 


Moreover, by the Monotone Convergence Theorem, 


du(x). 


[ x. y) dv(y) 
y 


fd(uXv) = lm pe d(mXv). 
XxY k->00 J yxy 
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It remains to prove that 


du(x). (7) 


k-00 


lim | fas y) dv(y) 


d(x )= [| [1 naw 


If we excise from X X Y a set of 4 X v-measure zero, then the right-hand side of (7) 
remains unchanged and, by Lemma 4, so does the left-hand side. Therefore, by possibly 
excising from X X Y a set of uw X v-measure zero, we may suppose that for all x € X and all k, 
x(x, -) is integrable over Y with respect to v. 

Fix x € X. Then {y;(x, -)} is an increasing sequence of simple v-measurable functions 
that converges pointwise on Y to f(x, -). Thus f(x, -) is v-measurable and, by the Monotone 
Convergence Theorem, 


[fs »)40() = fim [ ox(s, ») a) 8) 


For each x € X, define h(x) = fy f(x, y)dv(y) and hg(x) = fy ge(x, y) dv(y). According 
to the preceding theovem an h,: X — R is integrable over X with respect to pw. Since {hy} 
is an increasing sequence of nonnegative measurable functions that converges pointwise on 
X toh, by the Monotone Convergence Theorem, 


k—>00 


lim ; [ous y) dv(y) 


dp(x)= lim fredu= [rdu= [| r.rrdecr}oncn 


Therefore (7) is verified. The proof is complete. O 


In order to apply Fubini’s Theorem, one must first verify that f is integrable with 
respect to yz x v; that is, one must show that f is a measurable function on X X Y and that 
J \fld(wxXv) < oo. The measurability of f on X x Y is sometimes difficult to establish, but 
in many cases we can establish it by topological considerations (see Problem 9). In general, 
from the existence and finiteness of the iterated integral on the right-hand side of (1), we 
cannot infer that f in integrable over X XY (see Problem 6). However, we may infer from the 
following theorem that if v is complete, the measures pz and v are o-finite and f is nonnegative 
and measurable with respect to uw X v, then the finiteness of the iterated integral on the 
right-hand side of (1) implies that f is integrable over X x Y and the equality (1) does hold. 


Tonelli’s Theorem Let (X,A,) and (Y,B,v) be two o-finite measure spaces and v be 
complete. Let f be a nonnegative (ys X v)-measurable function X X Y. Then for almost all 
x € X, the x-section of f, f(x, -), is v-measurable and the function defined almost everywhere 
on X by x +> the integral of f(x, -) over Y with respect to v is z-measurable. Moreover, 


[,, samen [| [ 1 y) dv(y) 


Proof ‘The Simple Approximation Theorem tells us that there is an increasing sequence {;} 
of simple functions that converges pointwise on X X Y to f and, for each k, 0 < g < f on 
X XY. At this point in the proof of Fubini’s Theorem, we invoked the integrability of the 


d(x). (9) 
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nonnegative function | f| to conclude that since each 0 < g < | f| on X, each g, 1s integrable 
and hence we were able to apply Theorem 6 for each g;. Here we observe that the product 
measure p Xv is o-finite since both yp and v are o-finite. Therefore we may invoke assertion 
(i) of the Simple Approximation Theorem in order to choose the sequence to {p,} to have 
the additional property that each og, vanishes outside of a set of finite measure and therefore 
is integrable. The proof from this point on is exactly the same as that of Fubini’s Theorem. L| 


Two comments regarding Tonelli’s Theorem are in order. First, each of the integrals 
in (9) may be infinite. If one of them is finite, so is the other. Second, if 4 is complete, then 
the right-hand integral in (9) may be replaced by an iterated integral in the reverse order. 
Indeed, we have considered iterated integration by integrating first with respect to y and 
then with respect to x. Of course, all the results hold if one integrates in the reverse order, 
provided in each place we required the completeness of y we now require completeness of p. 


Corollary 7 (Tonelli) Let (X, A, 1) and (Y, B,v) be two o-finite, complete measure spaces 
and f a nonnegative (4 Xv)-measurable function on X XY. Then (i) for almost all x € X, 
the x-section of f, f(x, -), is v-measurable and the function defined almost everywhere on X 
by x +> the integral of f(x, -) over Y with respect to v is u-measurable and (ii) for almost all 
y €Y, the y-section of f, f(-, y) is w-measurable and the function defined almost everywhere 
on Y by y+> the integral of f(-, y) over X with respect to p is v-measurable. If 


[ | I f(x, y) doy) 


then f is integrable over X XY with respect to 4 Xv and 


[fv y) du(x) 


Proof Tonelli’s Theorem tells us that f is integrable over X X Y with respect to w Xv and 
we have the right-hand equality in (11). Therefore f is integrable over X X Y with respect to 
pu. Xv. We now apply Fubini’s Theorem to verify the left-hand equality in (11). a 


du(x) <0, (10) 


)| av v= | fd(uxv) =| [fa y) dv(y ant ). (4) 


The examples in the problems show that we cannot omit the hypothesis of the 
integrability of f from Fubini’s Theorem and cannot omit either o-finiteness or nonnegativity 
from Tonelli’s Theorem (see Problems 5 and 6). In Problem 5 we exhibit a bounded function 
f on the product X x Y of finite measure spaces for which 


[ d(x yf [sos anes 


even though each of these iterated integrals is properly defined. 

We conclude this section with some comments regarding a different approach to the 
development of a product measure. Given two measure spaces (X,A,) and (Y,B,v), 
the smallest o-algebra of subsets of X X Y containing the measurable rectangles is denoted 
by A xB. Thus the product measure is defined on a o-algebra containing A x B. These 


J #0 ») an(9) | ant dv(y) 
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two measures are related by Proposition 10, of Chapter 17, which tells us that the p X v- 
measurable sets that have finite 4 x v-measure are those that differ from sets in A x B by 
sets of 4 X v-measure zero. Many authors prefer to define the product measure to be the 
restriction of 4. Xv to A X B. The advantage of our definition of the product measure is that 
this does what we want for Lebesgue measure: As we will see in the next section, the product 
of m-dimensional Lebesgue measure with k-dimensional Lebesgue measure is (m + k)- 
dimensional Lebesgue measure. Since our hypotheses for the Fubini and Tonelli Theorems 
require only measurability with respect to the product measure, they are weaker than 
requiring measurability with respect to A x B. Moreover, a function that is integrable with 
respect to A x B is also integrable with respect to the product measure that we have defined. 

The product measure is induced by an outer measure and therefore is complete. But 
we needed to assume that v is complete in order to show that if E C X X Y is measurable 
with respect to our product measure, then almost all the x-sections of E are v-measurable. If, 
however, E is measurable with respect to AB, then all of the x-sections on E belong to A even 
if is not complete. This follows from the observation that the collection of subsets of XXY that 
have all of their x-sections belonging to B is a c-algebra containing the measurable rectangles. 


PROBLEMS 
1. Let AC X and let B be a v-measurable subset of Y. If A X B is measurable with respect to the 
product measure p Xv, is A necessarily measurable with respect to? 


2. Let N be the set of natural numbers, M = 2N, and c the counting measure defined by setting 
c( E) equal to the number of points in E if E is finite and oo if E is an infinite set. Prove that 
every function f: N > Ris measurable with respect toc and that f is integrable over N with 


ie.) 
respect to c if and only if the series } f(k) is absolutely convergent in which case 


k=1 
[ fac= > Hb). 
N k=1 


3. Let (X, A, uw) = (Y, B, v) = (N, M,c), the measure space defined in the preceding 
problem. State the Fubini and Tonelli Theorems explicitly for this case. 
4. Let (N, M, c) be the measure space defined in Problem 2 and (X, A, ) a general measure 
space. Consider N X X with the product measure c X y. 
(i) Show that a subset E of NX X is measurable with respect to c X p if and only if for each 
natural number k, Ex = {x € X|(k, x) € E} is measurable with respect to p. 


(ii) Show that a function f: NX X — R is measurable with respect to c X w if and only if for 
each natural number k, f(k, -): X + Ris measurable with respect to p. 


(iti) Show that a function f: Nx X —> R is integrable over N x X with respect to c X py if and 
only if for each natural number k, f(k, -): X > R is integrable over X with respect to 
and 


> y | f(k, x)| du(x) < oo. 


k=1 


(iv) Show that if the function f: NX X — R is integrable over N X X with respect to c X p, 
then 


[ fatexny=S f f(&, x)au(x) <0 
NXX k=1 YX 


10. 


11. 


12. 
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. Let (X, A, w) = (Y, B, v) =(N, M, c), the measure space defined in Problem 2. Define 


f: NXN — R by setting 
2-2-* ifx=y 
f(x, y)=4 -242~%* ifx=y+1 
0 otherwise. 


Show that f is measurable with respect to the product measure c xc. Also show that 


[| fre m acon acm [| fem md ein 


Is this a contradiction either of Fubini’s Theorem or Tonelli’s Theorem? 


dc(m). 


. Let X = Y be the interval [0,1], with A = B the class of Borel sets. Let 4 be Lebesgue 


measure and v = c the counting measure. Show that the diagonal A = {(x, y)|x = y} is 
measurable with respect to the product measure wu Xc (is an Rg, in fact). Show that if f is 
the characteristic function of D, 


I, 1)x(0, 1] faluxe)# I, 1 ff, 1 f(x y) tly) d(x). 


Is this a contradiction either of Fubini’s Theorem or Tonelli’s Theorem? 


. Prove that the conclusion of Tonelli’s Theorem is true if one of the spaces is the space 


(N, M, c) defined in Problem 2 and the other space is a general measure space that need not 
be o-finite. 


. In the proof of Fubini’s Theorem justify the excision from X XY of a set of wv measure zero. 


. Let X = Y = [0, 1], and let u = v be Lebesgue measure. Show that each open set in X X Y is 


measurable, and hence each Borel set in X X Y is measurable. Is every continuous real-valued 
function on [0, 1] x[0, 1] measurable with respect to the product measure? 


Let h and g be integrable functions on X and Y, and define f(x, y) = h(x)g(y). Show that f 
is integrable on X X Y with respect to the product measure, then 


| fa(uxr)= | hdwf gar 
XXY xX Y 


(Note: We do not need to assume that y and v are o-finite.) 


Show that Tonelli’s Theorem is still true if, instead of assuming pu and v to be o-finite, we 
merely assume that {(x, y)| f(x, y) #0} is a set of o-finite measure. 


For two measure spaces (X, A, uw) and (Y,B,v) we have defined A X B to be the smallest 
o-algebra that contains the measurable rectangles. 


(i) Show that if both measures are o-finite, then wXv is the only measure on AXB that assigns 
the value 4(A) -»(B) to each measurable rectangle A X B. Also that this uniqueness 
property may fail if we do not have o-finiteness. 


(11) Show that if E ¢ AXB, then E, € B for each x. 


(iii) Show that if f is measurable with respect to A Xx B, then f(x, -) is measurable with 
respect to B for each x. 
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13. If {( Xx, Ag, wx) }{_, 18 a finite collection of measure spaces, we can form the product measure 
J X--»X py On the space X, X--- x X, by starting with the semiring of rectangles of 
the form R = A; X---X A,, define u(R) = TI wx( Ax), Show that u is a premeasure and 
define the product measure to be the Carathéodory extension of 4. Show that if we identify 
(X1X---XXp)X(X p41 X---XXp) with (X1X---XX,), then (jy X---Xwp)X (Mpg 1X: Xn) = 
By X+++X pn. 

14, A measure space (X, M, w) such that (X) = 1 is called a probability measure space. Let 
{(X), Ay, Ma) }aea be a collection of probability measure spaces parametrized by the set A. 
Show that we can define a probability measure 


w= [] wa 


AeA 


on a suitable o-algebra on the Cartesian product TJ ,< X, so that 


w(A) = [TT wal An) 


AEA 


when A = JJ yea Aa. (Note that (A) can only be nonzero if all but a countable number of 
the A, have u(A,) = 1.) 


20.2 LEBESGUE MEASURE ON EUCLIDEAN SPACE R” 


For a natural number n, by R” we denote the collection of ordered n-tuples of real numbers 
X = (x1,...,%,). Then R” is a linear space and there is a bilinear form (-, -): R’ XR" > R 
defined by 


(x, y)= > Xx: ye for all x, y ER". 
k=1 


This bilinear form is called the inner product or the scalar product. It induces a norm || - | 


defined by 
xl = J(x, x) = D> x? for allx eR". 
k=1 


This norm is called the Euclidean norm. It induces a metric and thereby a topology on R”. 
The linear space R”, considered with this inner product and induced metric and topology, is 
called n-dimensional Euclidean space. 

By a bounded interval in R we mean a set of the form [a, b], [a, b),(a, b] or (a, b) for 
real numbers a < b. So here we are considering the empty-set and the set consisting of a 
single point to be a bounded interval. For a bounded interval J with end-points a and b, we 
define its length £(/) to be b — a. 


Definition By a bounded interval in R” we mean a set I that is the Cartesian product of n 
bounded intervals on real numbers, 


l=)Xhx::--XIh,. 
We define the volume of I, vol(1), by 


vol(1) = €(,) -€(a) ++ €(In). 
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Definition We call a point in R" an integral point provided each of its coordinates is an 
integer and for a bounded interval I in R", we define its integral count, p!""8"“'( I), to be the 
number of integral points in I. 


Lemma 8 For each « > 0, define the ¢-dilation T-: R" > R" by T-(x) = €- x. Then for each 
bounded interval I in R", 


him pintegral ( T.( I) ) 


€—>00 ef 


= vol(I). (12) 
Proof For a bounded interval J in R with end-points a and b, we have the following estimate 
for the integral count of I (see Problem 18): 
(b—a)-1<pi™8'(T) < (b—a) +1. (13) 
Therefore for the interval J = I; X h X---XI,, since 
integral 7) — integral p) .., yintegraly 7) 

if each J, has end-points a, and b;,, we have the estimate: 

[(b1 — a1) —1]-++-[(bn — an) —1] < pi!" (1) < [(b) — a1) +1] «+= [(bn —n) +1]. (14) 
For € > 0 we replace the interval J by the dilated interval T.(J) and obtain the estimate 
[e-(bj —ay )—1]-- + - [€-(bn — an )—1] < pi?" (T.(1)) < [e-(b1 - a1) +1] += [€-(bn - an) +1]. 


(15) 
Divide this inequality by e” and take the limit as € > oo to obtain (12). LJ 


We leave the proof of the following proposition as an exercise in induction, using the 
property that the Cartesian product of two semirings is a semiring (see Problem 25). 


Proposition 9 The collection T of bounded intervals in R" is a semiring. 


Proposition 10 The set function volume, vol: I — [0, oo), is a premeasure on the semiring 
T of bounded intervals in R". 


Proof We first show that volume is finitely additive over finite disjoint unions of bounded 
intervals. Let J be a bounded interval in R” that is the union of the finite disjoint finite 
collection on bounded intervals {J* L-j: Lhen for each € > 0, the bounded interval T.( J) is 
the union of the finite disjoint collection of bounded intervals {T.(I* )}""_,. It is clear that the 
integral count y'"!¢8"! is finitely additive. Thus 


; m 
pintesralT.(1)) = >, vol(T.(1*)) for all € > 0. 

k=1 
Divide each side by e” and take the limit as € > oo to obtain, by (12), 


vol(I) = s vol(I*). 
k=1 
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Therefore the set function volume is finitely additive. 


It remains to establish the countable monotonicity of volume. Let J be a bounded 
interval in R” that is covered by the countable collection on bounded intervals {J* pane 
We first consider the case that I is a closed interval and each J‘ is open. By the Heine- 
Borel Theorem, we may choose a natural number m for which I is covered by the finite 
subcollection {I* /_,- It is clear that the integral count pu!” is monotone and finitely 
additive and therefore, since the collection of intervals is a semiring, finitely monotone. Thus 


pintesral (7) < s pintegral phy. 
k=1 


Dilate these intervals. Therefore 
. m . 
pintegral T.( 1)) < Ss pintesral Te( I‘)) for alle > 0. 

k=1 
Divide each side by e” and take the limit as € — oo to obtain, by (12), 

m CO 

vol(I) < ¥ vol(I*) < ¥ vol(J*). 
k=1 k=1 


It remains to consider the case of a collection {I* }r2., of general bounded intervals that cover 
the interval J. Let « > 0. Choose a closed interval / that is contained in J and a collection 
{I ss a of open intervals such that each J” C 7” and, moreover, 


vol(I) — vol(7) < and vol(7”) — vol(I”) < €/2™ for all m. 


By the case just considered, 


k=1 
Therefore 
OO 
vol(I) < § vol(I*) +2e. 
k=1 
Since this holds for all € > 0 it also holds for « = 0. Therefore the set function volume is a 
premeasure. L 


Definition The outer measure yx induced by the premeasure volume on the semiring 
of bounded intervals in R" is called Lebesgue outer measure on R". The collection of 
i7,-measurable sets is denoted by L” and called the Lebesgue measurable sets. The restriction 
of 7, to L” is called Lebesgue measure on R" or n-dimensional Lebesgue measure and 
denoted by pin. 


Theorem 11 The c-algebra L” of Lebesgue measurable subsets of R" contains the bounded 
intervals in R" and, indeed, the Borel subsets of R". Moreover, the measure space (R", CL", jn) 
is both o-finite and complete and for a bounded interval I in R", 


Hn (I) = vol(I). 
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Proof According to the preceding proposition, volume is a premeasure on the semiring of 
bounded intervals in R”. It clearly is o-finite. Therefore the Carathéodory -Hahn Theorem 
tells us that Lebesgue measure is an extension of volume and the measure space (R”, £”, py ) 
is both o-finite and complete. It remains to show that each Borel set is Lebesgue measurable. 
Since the collection of Borel sets is the smallest o-algebra containing the open sets, it suffices 
to show that each open subset O of R" is Lebesgue measurable. Let O be open in R”. The 
collection of points in © that have rational coordinates is a countable dense subset of O. 
Let {zx}7°, be an enumeration of this collection. For each k, consider the open cube” Tien 
centered at z, of edge-length 1/n. We leave it as an exercise to show that 


O= (J Tn (16) 
Ik,nCO 
and therefore since each J; , is measurable so is O, the countable union of these sets. L] 


Corollary 12 Let E be a Lebesgue measurable subset of R" and f: E — R be continuous. 
Then f is measurable with respect to n-dimensional Lebesgue measure. 


Proof Let © be an open set of real numbers. Then, by the continuity of f on E, f~'(O) = 
EU, where U is open in R”. According to the preceding theorem, U/ is measurable and 
hence so is f~!(O). O 


The Regularity of Lebesgue Measure The following theorem and its corollary strongly 
relate Lebesgue measure on R” to the topology on R” induced by the Euclidean norm. 


Theorem 13 Let E of a Lebesgue measurable subset of R". Then 
Hn (E) = inf {un(O) | ECO, O open} (17) 


and 
pn( E) = sup {pn(K)| K CE, K compact}. (18) 


Proof We consider the case in which E is bounded, and hence of finite Lebesgue measure, 
and leave the extension to unbounded E as an exercise. We first establish (17). Let € > 0. 
Since f,(E) = u7(E) < on, by the definition of Lebesgue outer measure, we may choose a 


countable collection of bounded intervals in R”, {J’"}”°_,, which covers E and 


SS pn") <pn( B) + 6/2. 


m=1 


For each m, choose an open interval that contains J” and has measure less than yu, (J) + 
e/[2™+1]. The union of this collection of open intervals is an open set that we denote by O. 
Then E C O and, by the countable monotonicity of measure, p,(O) < u,(£) +. Thus (17) 
is established. 


2By a cube in R” we mean an interval that is the Cartesian product of n intervals of equal length. 
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We now establish (18). Since E is bounded, we may choose a closed and bounded set 
K’ that contains E. Since K’~E is bounded, we infer from the first part of the proof that 
there is an open set O for which K’~E C O and, by the excision property of py, 


bn (O~[K'~E]) <e. (19) 


Define 
K = K'~0O. 


Then K is closed and bounded and therefore compact. From the inclusions K’~E C © and 
EC K’ we infer that 
K = K'~OCK'~[K’~E] = K'NECE 


and therefore K C E. On the other hand, from the inclusion E C K’ we infer that 
E~K = EX[K'~O] = ENO 


and 
ENOCO-N[K’~E]. 


Therefore, by the excision and monotonicity properties of measure and (19), 
Hn( E) — tn(K) = pn( E~K) < n(O~[K'~E]) <. 


Thus (18) is established and the proof is complete. O 


Each Borel subset of R” is Lebesgue measurable and hence so is any Gs or Fy 
set. Moreover, each set that has outer Lebesgue measure zero is Lebesgue measurable. 
Therefore the preceding theorem, together with the continuity and excision properties of 
measure, provides the following relatively simple characterization of Lebesgue measurable 
sets. It should be compared with Proposition 10 of Chapter 17. 


Corollary 14 For a subset E of R", the following assertions are equivalent: 


(i) E is measurable with respect to n-dimensional Lebesgue measure. 
(ti) There is a Gs subset G of R" such that 


EC Gand pp, (G~E) = 0. 
(tii) There is a F, subset F of R" such that 
FC Eand p)(E~F) = 0. 


We leave it as an exercise (see Problem 20) to infer from the above characterization 
of Lebesgue measurable sets that Lebesgue measure is translation invariant in the following 
sense: For E C R" and z € R", define the translation of E by z by 


E+z={x+z| xe E}. 
If E is z,-measurable, then so is E + z and 


Hn( E) = pn(E +2). 


Section 20.2 Lebesgue Measure on Euclidean Space R” 429 


Lebesgue Measure as a Product Measure For natural numbers n,m, and k such that 
n= m-+k, consider the sets R", R”, R“, and R” X R* and the mapping 


og: R" > R” xR‘ 
defined by 
p(X1,---,Xn) = ((41,---, Xm), (Xm41, +--+, Xm4k)) for all x € R”. (20) 


This mapping is one-to-one and onto. Each of the sets R”, R”, and R* has a linear structure, 
a topological structure, and a measure structure, and the product space R” X R* inherits a 
linear structure, a topological structure, and a measure structure from its component spaces 
R” and R‘. The mapping ¢ is an isomorphism with respect to the linear and topological 
structures. The following proposition tells us that the mapping ¢ is also an isomorphism from 
the viewpoint of measure. 


Proposition 15 For the mapping 9: R" > R™ X R‘ defined by (20), a subset E of R" is 
measurable with respect to n-dimensional Lebesgue measure if and only if its image o( E) is 
measurable with respect to the product measure bm X by, on R™ X R*, and 


Mn(E) = (Mm X we) (9(E)). 


Proof Define 7, to be the collection of intervals in R” and vol, the set function volume 
defined on Z,,. Since vol, is a o-finite premeasure, it follows from the uniqueness part of 
the Carathéodory -Hahn Theorem that Lebesgue measure jz, is the unique measure on CL” 
which extends vol, : Z, — [0, oo]. It is clear that 


uin(L) = (ttm X e)((L)) for all 1 € Zp. (21) 


We leave it as an exercise for the reader to show that this implies that outer measures are 
preserved by ¢ and therefore E belongs to £" if and only if g( E) is (um X wx )-measurable. 
Since ¢ is one-to-one and onto it follows that if we define 


w!(E) = (im X wx) ((E)) for all E € £", 


then y’ is a measure on £” that extends vol,: Z, — [0, oo]. Therefore, by the above 
uniqueness assertion regarding py, 


n(E) = w'(E) = (Mm X ee) (P(E) for all E € L". 
This completes the proof. | UL 


From this proposition, the completeness and o-finiteness of Lebesgue measure and the 
Theorems of Fubini and Tonelli, we have the following theorem regarding integration with 
respect to Lebesgue measure on LR". 


Theorem 16 For natural numbers n, m, and k such that n = m +k, consider the mapping 
go: R" + R™ XR¥ defined by (20). A function f : R™ XR‘ > R is measurable with respect to 
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the product measure [bm X x if and only if the composition f og: R" > R is measurable with 
respect to Lebesgue measure fy. If f is integrable over R" with respect to Lebesgue measure 


Hn, then 
ft aun =|. I. f(x, y)dpm(x) 


Moreover, if f is nonnegative and measurable with respect to Lebesgue measure 1, the above 
equality also holds. 


dpx(y). (22) 


Lebesgue Integration and Linear Changes of Variables We denote by £(R") the linear 
space of linear operators T: R’ + R". We denote by GL(n, R) the subset of £(R") 
consisting of invertible linear operators T: R" — R", that is, linear operators that are 
one-to-one and onto. The inverse of an invertible operator is linear. Under the operation of 
composition, GL(n, R) is a group called the general linear group of R". For 1 < k <n, we 
denote by e; the point in R” whose kth coordinate is 1 and other coordinates are zero. Then 
{€1,..., &,} 18 the canonical basis for £(R” ). Observe that a linear operator T: R" > R" is 
uniquely determined once T(e; ) is prescribed for 1 < k <n, since if x = (x1,..., x,), then 


T(x) = T(x1e1 +--+ +Xn€n) = x1T(e,) +---+XnT (en) for all x € R”. 
The only analytical property of linear operator that we need is that they are Lipschitz. 
Proposition 17 A linear operator T: R" — R" is Lipschitz. | 
Proof Let x belong to R”. As we have just observed, by the linearity of T, 
T(x) = xyT(e,) +---+x,T(e,) for all x € R". 


Therefore, by the subadditivity and positivity homogeneity of the norm, 


T(x) = aT (er) + an T(en)Il <> bee - IT (ee) 
k=1 


Hence, if we define c = J s [T(e.)||2, by the Cauchy-Schwarz Inequality, 
k=1 


T(x) <c- [el 


For any u, v € R", set x = u —v. Then, by the linearity of T, T(x) = T(u—v) = T(u)—T(v), 
and therefore 
|T(u) — T(v) Il <c- lu — vf. 


O 


We have already observed in our study of Lebesgue measure on the real line that 
a continuous function will not, in general, map Lebesgue measurable sets to Lebesgue 
measurable sets. However, a continuous mapping that is Lipschitz does map Lebesgue 
measurable sets to Lebesgue measurable sets. 
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Proposition 18 Let the mapping V: R" — R" be Lipschitz. If E is a Lebesgue measurable 
subset of R", so is V(E). In particular, a linear operator T: R" —> R” maps Lebesgue 
measurable sets to Lebesgue measurable sets. 


Proof A subset of R” is compact if and only if it is closed and bounded and a continuous 
mapping maps compact sets to compact sets. Since V is Lipschitz, it is continuous. Therefore 
maps bounded F, sets to Fg sets. 


Let E be a Lebesgue measurable subset of R”. Since R” is the union of a countable 
collection of bounded measurable sets, we may assume that E is bounded. According to 
Corollary 14, E = AU D, where A is an Fy subset of R” and D has Lebesgue outer measure 
zero. We just observed that W(A) is an F, set. Therefore to show that V( E) is Lebesgue 
measurable it suffices to show that the set V(D) has Lebesgue outer measure zero. 


Let c > 0 be such that 
'P(u) —V(v)|| <c- lu — vl for allu, ve R”. 


There is a constant c’ (see Problem 24) that depends solely on c and n such that for any 
interval J in R", 
pi(¥(1)) <d -vol(1). (23) 


Let € > 0. Since u*(D) = 0, there is a countable collection {1}, of intervals in R” that 
oO 

cover D and for which ¥ vol(I*) < . Then (‘¥(J*)}®, is a countable cover of ¥(D). 
=1 


Therefore by the estimate (23) and the countable monotonicity of outer measure, 


us(¥(I)) = S wg(H(Ue)) < De -vol( tk) <e'-€ 
= k= 


Since this holds for all € > 0 it also holds for « = 0. L] 


Corollary 19 Let the function f: R" — R be measurable with respect to Lebesgue measure 
and the operator T : R" — R° be linear and invertible. Then the composition f oT: R" > R 
also is measurable with respect to Lebesgue measure. 


Proof Let © be an open subset of R. We must show that (fo T)~!(O) is Lebesgue 
measurable. However, 


(fot) (0) =T!(f-'(O)). 


But the function f is measurable and therefore the set f—'(O) is measurable. On the 
other hand, the mapping 7! is linear and therefore, by the preceding proposition, maps 
Lebesgue measurable sets to Lebesgue measurable sets. Thus (f © T)~!(O) is Lebesgue 
measurable. O 


We will establish a general formula for the change in the value of a Lebesgue integral 
over R" under a linear change of variables and begin with dimensions n = 1 and 2. 
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Proposition 20 Let f: R > R be integrable over R with respect to one-dimensional Lebesgue 
measure. If a, B € R, a #0, then 


] 
[ fd = [ f(ax) dy (x) and [ fam = | F(x+B)dui(x). (24) 


Proof By the linearity of integration we may assume f is nonnegative. Approximate the 
function f an increasing sequence of simple integrable functions and thereby use the 
Monotone Convergence Theorem to reduce the proof to the case that f is the characteristic 
function of a set of finite Lebesgue measure. For such a function the formulas are evident. 


Proposition 21 Let f: R? > R be integrable over R* with respect to Lebesgue measure p12 
and c #0 be a real number. Define y: R* > R, Ww: R? > Randy: R2>R by 


v(x, y) = fly, x), W(x, y) = f(x, x+y) and (x, y) = f(ex, y) forall (x, y) € R*. 


Then ¢, , and 7 are integrable over R2 with respect to Lebesgue measure 9. Moreover, 
n § P & Mb 


i] dyn = | edun = | wduz, 
R? R? R? 


1 
[ fam=—- [ ndpo. 
R? Ic| JR 


Proof We infer from Corollary 19 that each of the functions y, Ww, and 7 is w2-measurable. 
Since integration is linear, we may assume that f is nonnegative. We compare the integral 
of f with that of » and leave the other two as exercises. Since f 1s 2-measurable, we infer 
from Fubini’s Theorem, as expressed in Theorem 16, that 


| fdu2= | | f(x, a) dui(y). 

R2 R|/R 

However, by the definition of the function y, for almost all y € R, 
f fs »)du(2) = fo. 2)dua(n 


and therefore 


[ | [ F(x, y) dus (x) 


Since ¢ is nonnegative and y2-measurable we infer from Tonelli’s Theorem, as expressed in 
Theorem 16, that 
I 


duy(y) = [ | [ oly, a) dy(y). 


[ e, s)dits) aun(y) = [ pdm. 
JR R? 


Therefore 


du =| gdp. 
Jf ; R? ° a 
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So far the sole analytical property of linear mappings that we used 1s that such mappings 
are Lipschitz. We now need two results from linear algebra. The first is that every operator 
T € GL(n, R) may be expressed as the composition of linear operators of the following 
three elementary types: 


Type 1: T(e;) =c-e; and T(ex) = eg, fork#j: 
Type 2: T(e;) = ej41, T(ej41) = ej and T(e,) = ex fork#j,j+1: 
Type 3: T(e;) = e; + ej41, and T(e,) = e fork #j: 


That every invertible linear operator may be expressed as the composition of elemen- 
tary operators is an assertion in terms of linear operator of a property of matrices: every 
invertible n Xn matrix may be reduced by row operations to the identity matrix. 

The second property of linear operators that we need is the following: To each linear 
operator T: R” — R" there is associated a real number called the determinant of T and 
denoted by det7, which possesses the following three properties: 


(i) For any two linear operators T, S: R" > R" 
det(S oT) = detS - detT, (25) 


(ul) detT =c if T is of Type 1, detT = —1 if T is of Type 2, and detT = 1 if T is of Type 3. 
(iii) If T(e, ) = 1 and T maps the subspace {x € R” |x = (x1, x2,..., Xn—1, 0)} into itself, 
then detT = det7’, where 7’: R’~! > R"7! is the restriction of T to R’~!. 


Theorem 22 Let the linear operator T: R" — R" be invertible and the function f: R" > R 
be integrable over R" with respect to Lebesgue measure. Then the composition f oT: R” > R 
also is integrable over R” with respect to Lebesgue measure and 


[ foT dun = f dptn. (26) 


1 
\detT| R" 


Proof Integration is linear. We therefore suppose f is nonnegative. In view of the mul- 
tiplicative property of the determinant and the decomposability of an invertible lmear 
operator into the composition of elementary operators, we may also suppose that T is 
elementary. The case n = 1 is covered by (24). By Proposition 21, (26) holds if n = 2. We 
now apply an induction argument. Assume we have proven (26) for m > 2 and consider 
the case n = m+ 1. Since T is elementary and n > 3, , either (1) T(e, ) = e, and T maps 
the subspace {x € R" |x = (x1,..., X,_1, 0)} into itself or (ii) T(e; ) = e, and T maps the 
subspace {x € R” |x = (0, x2,..., x, )} into itself. We consider case (1) and leave the similar 
consideration of case (ii) as an exercise. Let 7’ be the operator induced on R"~! by T. 
Observe that |detT’| = |detT|. We now again argue as we did in the proof of Proposition 
21. The function f o T is »-measurable. Therefore we infer from Fubini’s Theorem and 
Tonelli’s Theorem, as formulated for Lebesgue measure in Theorem 16, together with the 
validity of (26) for m =n — 1, that 
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rid Re dpi (Xn) 
J Re 


| f° Pae -| | f oT (x1, x2,...,%n) don—1(41,.--5 Xn-1) 
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Corollary 23 Let the linear operator T: R" — R" be invertible. Then for each Lebesgue 
measurable subset E of R", T(E) is Lebesgue measurable and 


Mn(T(E)) = |det(T)| - un( E). (27) 


Proof We assume that E is bounded and leave the unbounded case as an exercise. Since T 
is Lipschitz, T(E) is bounded. We infer from Proposition 18 that the set T(E) is Lebesgue 
measurable and it has finite Lebesgue measure since it is bounded. Therefore the function 
f = X7(£) 1s integrable over R” with respect to Lebesgue measure. Observe that foT = yz. 
Therefore 


f 0 T din = pn(E) and | f din = bn(T(E)). 
R" R" 
Hence (27) follows from (26) for this particular choice of f. a 


By a rigid motion of R” we mean a mapping V of R” onto R" that preserves Euclidean 
distances between points, that is, 


Cu) — ¥(v)|| = lu — v|| for allu, v eR”. 


A theorem of Mazur and Ulam: tells us that every rigid motion is a constant perturbation 
of a linear rigid motion, that is, there is a point xp in R" and T: R” — R" linear such that 
W(x) = T(x) + x0 for all x € R", where T is a rigid motion. However, since a linear rigid 
motion maps the origin to the origin, T preserves the norm, that is, 


7(w)|| = |lul| for all u € R”. 


3See pages 49-51 of Peter Lax’s Functional Analysis [Lax02]. 
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Thus the following polarization identity (see Problem 28), 
1 
(u,v) = Flu + vl? — lu — v7} for all u, v € R®, (28) 
tells us that a linear rigid motion T preserves the inner product, that is, 
(T(u), T(v)) = (u, v) for allu, ve R". 


This identity means that T*T = Id. From the multiplicative property of the determinant 
and the fact that det? = detT*, we conclude that |det7| = 1. Therefore by the translation 
invariance of Lebesgue measure (see Problem 20) and (27) we have the following interesting 
geometric result: If a mapping on Euclidean space preserves distances between points, then 
it preserves Lebesgue measure. 


Corollary 24 Let V: R" > R" be a rigid motion. Then for each Lebesgue measurable subset 
E of R’, 
Hn(V(E)) = un( E). 


It follows from the definition of Lebesgue outer measure u* that the subspace 
V = {x € R"|x = (x1, x2,...,Xn-1, 0)} of R” has n-dimensional Lebesgue measure zero 
(see Problem 30). We may therefore infer from (27) that any proper subspace W of R” has 
n-dimensional Lebesgue measure zero since it may be mapped by an operator in GL(n, R) 
onto a subspace of V. It follows that if a linear operator T: R” > R” fails to be invertible, 
then, since its range lies in a subspace of dimension less than n, it maps every subset of R” to 
a set of n-dimensional Lebesgue measure zero. This may be restated by asserting that (27) 
continues to hold for linear operators T that fail be invertible. 


PROBLEMS 


15. Consider the triangle A = {(x, y) € R?|0 < x < a,0 < y < [b/a]x}. By covering A with 
finite collections of rectangles and using the continuity of measure, determine the Lebesgue 
measure of A. 


16. Let [a, b] be a closed, bounded interval of real numbers. Suppose that f: [a, b] > R is 
bounded and Lebesgue measurable. Show that the graph of f has measure zero with respect 
to Lebesgue measure on the plane. Generalize this to bounded real-valued functions of 
several real variables. 


17. Every openset of real numbers is the union of a countable disjoint collection of open intervals. 
Is the open subset of the plane {(x, y) € R?|0 <x, y < 1} the union of a countable disjoint 
collection of open balls? 


18. Verify inequality (13). 
19. Verify the set equality (16). 
20. Let ECR” andz eR". 
(i) Show that E + z is open if E is open. 
(ii) Show that FE + zis Gg if E is Gs. 
(iii) Show that w*(E +z) = px(E). 
(iv) Show that E is 4,-measurable if and only if E + z is u,»-measurable. 


21. For each natural number n, show that every subset of R” of positive outer Lebesgue measure 
contains a subset that is not Lebesgue measurable. 
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22. 
23. 
24. 
25. 
26. 


27. 


28. 
. Let the mapping T: R” > R" be linear. Define c = sup {|| T(x) || | ||x|| < 1}. Show that c is the 


31. 


32. 


33. 


For each natural number n, show that there is a subset of R” that is not a Borel set but is 
/4n-measurable. 


If (27) holds for each interval in R”, use the uniqueness assertion of the Carathéodory -Hahn 
Theorem to show directly that it also holds for every measurable subset of R". 


Let ¥: R” — R" be Lipschitz with Lipschitz constant c. Show that there is a constant c’ that 
depends only on the dimension n and c for which the estimate (23) holds. 


Prove that the Cartesian product of two semirings is a semiring. Based on this use an induction 
argument to prove that the collection of intervals in R" is a semiring. 


Show that if the function f: [0, 1] x [0, 1] + R is continuous with respect to each variable 
separately, then it is measurable with respect to Lebesgue measure py. 


Let g: R — R be a mapping of R onto R for which there is a constant c > 0 for which 
lg(u) — g(v)| >c-|u—v| forallu,veR. 
Show that if f: R — R is Lebesgue measurable, then so is the composition f og: R—> R. 


By using the bilinearity of the inner product, prove (28). 


smallest Lipschitz constant for T. 


- Show that a subspace of W of R” of dimension less than n has n-dimensional Lebesgue 


measure Zero by first showing this is so for the subspace {x € R” | x, = 0}. 


Prove the two change of variables formulas (24) first for characteristic functions of sets of 
finite measure, then for simple functions that vanish outside a set of finite measure and finally 
for nonnegative integrable functions of a single real variable. 


For a subset E of R, define 
o(E) ={(x, y)eR?| x-ye Eh. 


(i) If E is a Lebesgue measurable subset of R, show that o(E) is a measurable subset of 
R2. (Hint: Consider first the cases when E open, E a Gs, E of measure zero, and E 
measurable.) 


(ii) If f is a Lebesgue measurable function on R, show that the function F defined by 
F(x, y) = f(x — y) isa Lebesgue measurable function on R?. 


(ui) If f and g belong to L'(R, 41 ), Show that for almost all x in R, the function given by 
9(y) = f(x—y)g(y) belongs to L1(R, p11). If we denote its integral by h(x), show that 


h is integrable and 
[mau = f isidun-f teldn 
R R R 
Let f and g be functions in L'(R, y), and define f * gon R by 


(f*e)(y)= [ fly — x)g(x)duy(x). 


(i) Show that f*xg= gf. 
(ii) Show that (f *g)*h = f *(g*h) foreachh € L'(R, p,). 
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34. Let f be a nonnegative function that is integrable over R with respect to y;. Show that 
po{(x, y) €R’| O< ys f(x)b=mo{(x, y) eR? | o<y<f(x)}= f f(x) ax 


For each t > 0, define g(t) = wy{x € R| f(x) = t}. Show that ¢ is a decreasing function and 


[dui = | f(x) duy(x). 
0 R 


20.3 CUMULATIVE DISTRIBUTION FUNCTIONS AND BOREL MEASURES ON R 


Let I = [a, b] be a closed, bounded interval of real numbers and B(/) the collection of Borel 
subsets of J. We call a finite measure uw on B(/) a Borel measure. For such a measure, define 
the function g,: I > R by 

8u(x) = pla, x] for all x in J. 


The function g, is called the cumulative distribution function of jw. 


Proposition 25 Let uz be a Borel measure on B(1). Then its cumulative distribution function 
g,, is increasing and continuous on the right. Conversely, each function g: I — R that is increa- 
sing and continuous on the right is the cumulative distribution function of a unique Borel 
measure pt, on B( 1). 


Proof First let 4 be a Borel measure on B(/). Its cumulative distribution function is certainly 
increasing and bounded. Let xo belong to [a, b) and {x;,} be a decreasing sequence in (xo, b] 
that converges to xo. Then (7°, (x0, xx] = 9 so that, since y is finite, by the continuity of 
measure, 


O= (9) = lim w(xo, xc] = lim [gu (xe) — 8u(20)]- 
> OO k— 00 
Thus g,, is continuous on the right at xo. 


To prove the converse, let g: J + R be an increasing function that is continuous on 
the right. Consider the collection S of subsets of J consisting of the empty set, the singleton 
set {a}, and all subintervals of J of the form (c, d]. Then S is a semiring. Consider the set 
function w: S > R defined by setting 4(@) = 0, u{a} = g(a) and 


w(c, d] = g(d) — g(c) for (c, d] CI. 
We leave it as an exercise (see Problem 39) to verify that if (c, d] C J is the union of finite 


n 
disjoint collection LU (cx, dy], then 
k=1 


(4) - g(c) = Slade) ~ 9(ce)] 
k= 


oO 
and that if (c, d] C J is covered by the countable collection U (cx, dj], then 
k=1 


e(d) - g(c) < > [e(dx) — e( ce) (29) 
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This means that yw is a premeasure on the semiring S. By the Carathéodory -Hahn Theorem, 
the Carathéodory measure j induced by p is an extension of yw. In particular, each open 
subset of [a, b] is u*-measurable. By the minimality of the Borel o-algebra, the a-algebra of 
* measurable sets contains B(]). The function g is the cumulative distribution function for 
the restriction of @ to B(/) since for each x € [a, 5], 


pla, x] = w{a} + w(a, x] = g(a) +[g(x) — g(a)] = g(x). O 


It is natural to relate the continuity properties of a Borel measure to those of its 
cumulative distribution function. We have the following very satisfactory relation whose 
proof we leave as an exercise. 


Proposition 26 Let u be a Borel measure on B(1) and g,, its cumulative distribution function. 
Then the measure y is absolutely continuous with respect to Lebesgue measure if and only if 
the function g,, is absolutely continuous. 


For a bounded Lebesgue measurable function f on [a, b], the Lebesgue integral 
Sia. 6] f dm is defined, where m denotes Lebesgue measure. For a bounded function f on 
[a, b] whose set of discontinuities has Lebesgue measure zero, we proved that the Riemann 
integral f° f (x) dx is defined and 


J, fam = [ f(x) dx. 


There are two generalizations of these integrals, the Lebesgue-Stieltjes and Riemann-Stieltjes 
integrals, which we now briefly consider. 

Let the function g: J > R be increasing and continuous on the right. For a bounded 
Borel measurable function f: I + R, we define the Lebesgue-Stieltjes integral of f with 


respect to g over [a, b], which we denote by f° f dg, by 


I, (fae I, fie (30) 


Now suppose that f is a bounded Borel measurable function and g is increasing and 
absolutely continuous. Then g’ Lebesgue integrable function over [a, b] and hence so is f - g’. 
We have 


| tde-|[  fe’dm, 31) 
[a, b] [a, b] 


where the right-hand integral is the integral of f - 2’ with respect to Lebesgue measure m. 
To verify this formula, observe that it holds for f a Borel simple function and then, by the 
Simple Approximation Theorem and the Lebesgue Dominated Convergence Theorem, it 
also holds for a bounded Borel measurable function f. In this case, by Proposition 26, jr, is 
absolutely continuous with respect to m. We leave it to the reader to verify that function ¢’ 
is the Radon-Nikodym derivative of 4, with respect to m (see Problem 44). 

There is a Riemann-Stieltjes integral that generalizes the Riemann integral in the same 
manner that the Lebesgue-Stieltjes integral generalizes the Lebesgue integral. We briefly 
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describe this extension.’ If P = (xo, x1,...,Xn} isa partition of [a, b], we let || P|] denote the 
maximum of the lengths on the intervals determined by P and let C = {c1,...,c,}, where 
each c; belongs to [x;_1, x;]. For two bounded functions f: [a, b] > R and g: [a, b] > R, 
consider sums of the form 


S(f, @, P,C)= > Fe) [e(xi) — e(xi-1) 
k= 


If there is a real number A such that for each « > 0, there is a 6 > 0 such that 
if || P|| <6, then |S(f, g, P, C)- A] <e«, 


then f is said to be Riemann-Stieltjes integrable over J with respect to g and we set 


b 
A= | s(x)de(x), 


It is clear that if g(x) = x for all x € [a, b], then the Riemann-Stieltjes integral of f 
with respect to g is just the Riemann integral of f. Moreover, if f is continuous and g 
is monotone, then f is Riemann-Stieltjes integrable over I with respect to g.” However, 
a theorem of Camille Jordan tells us a function of bounded variation is the difference of 
increasing functions. Therefore a continuous function on J is Riemann-Stieltjes integrable 
over J with respect to a function of bounded variation. The Lebesgue-Stieltjes integral and 
the Riemann-Stieltjes integrals are defined for different classes of functions. However, they 
are both defined if f is continuous and g is increasing and absolutely continuous. In this case, 


they are equal, that 1s, 
b 
[ seran= rae 


since (see Problems 36 and 37) each of these integrals is equal to Sia 6] fg’ dm, the Lebesgue 
integral of f - g’ over [a, b] with respect to Lebesgue measure m. 


PROBLEMS 
35. Prove Proposition 26. 


36. Suppose f is a bounded Borel measurable function on [a, b] and g is increasing and absolutely 
continuous on fa, b]. Prove that if m denotes Lebesgue measure, then 


[ fag= fe'dm. 
[a, b] [a, b] 


37. Suppose f is a continuous function on [a, b] and g is increasing and absolutely continuous on 
[a, b|. Prove that if m denotes Lebesgue measure, then 


[ nevacco= | team 


4On pages 23-31 of Richard Wheedon and Antoni Zygmund’s Measure and Integral [WZ77] there is a precise 
exposition of the Riemann-Stieltjes integral. 


>The proof of this is a slight variation of the proof of the Riemann integrability of a continuous function. 
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38. 


39. 


41. 


42. 


43. 


45. 


Let f and g be functions on [—1, 1] such that f = 0 on [—1, 0], f = 1 and (0, 1], and g =0 
on [—1, 0), g = 1 and [0, 1]. Show that f is not Riemann-Stieltjes integrable with respect to 
g over [—1, 1] but is Riemann-Stieltjes integrable with respect to g on [—1, 0] and on [0, 1]. 


Prove the inequality (29). (Hint: Choose ¢ > 0. By the continuity on the right of g, choose 
ni > 0 so that g(b; + n;) < g(bj) + €2~', and choose 6 > 0 so that g(a + 6) < g(a) + €. Then 
the open intervals (a;, bj + i) cover the closed interval [a + 6, b].) 


. For an increasing function g: [a, b] > R, define 


g'(x) = lim. g(y). 


Show that g* is an increasing function that is continuous on the right and agrees with g 
wherever g is continuous on the right. Conclude that g = g*, except possibly at a countable 
number of points. Show that (g*)* = g*, and if g and G are increasing functions that agree 
wherever they are both continuous, then g* = G*. If f is a bounded Borel measurable 


function on [a, b], show that 
| fdg= | f dg". 
[a, b] [a, b] 


(i) Show that each bounded function g of bounded variation gives rise to a finite signed 
Borel measure v such that 


v(c, d] = g(d*) — g(c*) forall (c, d] C [a, D]. 


(ii) Extend the definition of the Lebesgue-Stieltjes integral Sia b] f dg to functions g of 
bounded variation and bounded Borel measurable functions f. 


(ii) Show that if | f| < M on [a, b] and if the total variation of g is T, then | Sia b] f dg| < MT. 


Let g be a continuous increasing function on [a, b] with g(a) =c, g(b) =d, and let f bea 
nonnegative Borel measurable function on [c, d]. Show that 


fogdg= f dm. 
[a, b] [c, d] 


Let g be increasing on [a, b]. Find a Borel measure yz on B([a, b]) such that 


b 
| f(x) dg(x) =| f dp for all f € C[a, b]. 
a [a, | 


. If the Borel measure yp is absolutely continuous with respect to Lebesgue measure, show that 


its Radon-Nikodym derivative is the derivative of its cumulative distribution function. 


For a finite measure ys on the collection B(R) of all Borel subsets of R, define g: R-> R 
by setting g(x) = y4(—00, x]. Show that each bounded, increasing function g: R — R that is 
continuous on the right and lim,-,-.o g(x) = 0 is the cumulative distribution function of a 
unique finite Borel measure on B(R). 
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20.4 CARATHEODORY OUTER MEASURES AND HAUSDORFF 
MEASURES ON A METRIC SPACE 


Lebesgue outer measure on Euclidean space R” has the property that if A and B are subsets 
of R” and there is a 6 > 0 for which ||u — v|| > 6 for allu ¢ A and ve B, then 


H(A U B) = w,(A) + wr, (B). 


We devote this short section to the study of measures induced by outer measures on a metric 
space that possess this property and a particular class of such measures called Hausdorff 
measures. 

Let X be a set and I a collection of real-valued functions on X. It is often of interest 
to know conditions under which an outer measure y* has the property that every function 
in I’ is measurable with respect to the measure induced by yx* through the Carathéodory 
construction. We present a sufficient criterion for this. Two subsets A and B of X are said 
to be separated by the real-valued function f on X provided there are real numbers a and b 
with a < b for which f < aon A and f > bonB. 


Proposition 27 Let ¢ be a real-valued function on a set X and y*: 2* — [0, oo] an outer 
measure with the property that whenever two subsets A and B of X are separated by ¢, then 


w(AU B) = p*(A) + p*(B) 
Then ¢ is measurable with respect to the measure induced by p*. 
Proof Let a be areal number. We must show that the set 
E={xeX| g(x)>a} 
is z*-measurable, that is, that for any € > 0 and any subset A of X of finite outer measure, 
u*(A)+e>p(ANE)+pu*(AN ES). (32) 
Define B= AN E and C = AN E°. For each natural number n, define 
B, = {xéB| g(x) >a+1/n} and R, = B,~By_1. 
We have 
OO 
UR 
k=n+1 
Now on B,_2 we have g > a+1/(n —2), while on R, we have g < a+1/(n—1). Thus @ 


B= B,U 


k~1 
separates R, and B,_2 and hence separates Ry, and (J R>;, since the latter set is contained 
i=] 


J= 
in Bo,_2. Consequently, we argue by induction to show that for each k, 


k-1 k 


k 
pe U %j = p*( Rx) + U &j = 2, #'(Roj): 
I= J= JF 
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Since >“ jai R2j G BC A, we have Lie _, M*(Roj) < w*(A), and so the series ye ( R2;) 
converges. Similarly, the series °°, 4*(Roj41) converges, and therefore so does the series 


Die H*( Rx). Choose n so large that Drun+1 M* (Ri) <¢. Then by the countable monotonicity 
of p*, 


oO 
w*(B) <m*(Br)+ DS) w*( Re) <m*(Bn) + 

k=n+1 

or 
u"(By) > w*(B) ~«. 
Now 
w(A) > w*( Br UC) = w*(Bn) + w*(C) 

since g separates B, and C. Consequently, 


pr(A) > w(B) + w*(C) — 
We have established the desired inequality (32). LJ 


Let (X, p) be a metric space. Recall that for two subsets A and B of X, we define the 
distance between A and B, which we denote by p(A, B), by 
p(A, B) = inf plu, v). 


By the Borel o-algebra associated with this metric space, which we denote by B(X), we 
mean the smallest o-algebra containing the topology induced by the metric. 


Definition Let (X, p) be a metric space. An outer measure p*: 2* —> [0, 00] is called 
a Carathéodory outer measure provided whenever A and B are subsets of X for which 
p(A, B) > 0, then 

w(AU B) = p*(A) + p*(B). 


Theorem 28 Let u* be a Carathéodory outer measure on a metric space (X, p). Then every 
Borel subset of X is measurable with respect to y*. 


Proof The collection of Borel sets is the smallest o-algebra containing the closed sets, and 
the measurable sets are a o-algebra. Therefore it suffices to show that each closed set is 
measurable. However, each closed subset F of X can be expressed as F = f—'(0) where f 
is the continuous function on X defined by f(x) = p(F, {x}). It therefore suffices to show 
that every continuous function is measurable. To do so, we apply Proposition 27. Indeed, let 
A and B be subsets of X for which there is a continuous function on X and real numbers 
a <b such that f < aon A and f > b on B. By the continuity of f, p(A, B) > 0. Hence, 
by assumption, u*(AU B) = w*(A) + u*(B). According to Proposition 27, each continuous 
function is measurable. The proof is complete. LJ 


We now turn our attention to a particular family of Carathéodory outer measures on the 
metric space (X, p). First recall that we define the diameter of a subset A of X, diam(A), by 


diam(A) = sup p(u, v). 


u,veA 
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Fix a > 0. For each positive real number a, we define a measure H, on the Borel o-algebra 
B(X) called the Hausdorff measure on X of dimension a. These measures are particularly 
important for the Euclidean spaces R", in which case they provide a gradation of size among 
sets of n-dimensional Lebesgue measure zero. 

Fix a > 0. Take € > 0 and for a subset E of X, define 


H‘)(E) = inf 5 [diam(A,)}*, 
k=] 


where {A;}72, 1s a countable collection of subsets of X that covers E and each A; has a 
diameter less than e. Observe that Hi? increases as € decreases. Define 


H,(E) =sup H‘«)( E) = lim H‘)(E). 


e>0 e—>0 


Proposition 29 Let (X, p) be a metric space and a a positive real number. Then H*: 2* + 
[0, co] is a Carathéodory outer measure. 


Proof It is readily verified that H* is a countably monotone set function on 2* and 
H;(@) = 0. Therefore H% is an outer measure on 2*. We claim it is a Carathéodory outer 
measure. Indeed, let E and F be two subsets of X for which p(E, F) > 5. Then 


H\O) (EU F) > H6O(E) + HO(F) 


as soon as € < 6: For if {Ax} is a countable collection of sets, each of diameter at most €, that 
covers FU F, no Ax can have nonempty intersection with both E and F. Taking limits as 
€ — 0, we have 

H*(EU F) > H#(E)+ H*(F). O 


We infer from Theorem 28 that H% induces a measure on a o-algebra that contains 
the Borel subsets of X. We denote the restriction of this measure to B( X) by Hg and call it 
Hausdorff a-dimensional measure on the metric space X. 


Proposition 30 Let (X, p) be a metric space, A a Borel subset of X, and a, B positive real 
numbers for which a < B. If Ha{ A) < 00, then Hg(A) = 0, 


Proof Let «>0. Choose {A;}?2, to be a covering of A by sets of diameter less than ¢ for which 
oO 
S) [diam(Ax)]* < Ha(A) +1. 
k=1 


Then 


Hi (A) < Y laiam( a PP} < &. Y laiam( a yl" <A *.[A(A) +1]. 


Take the limit as « + 0 to conclude that Hg( A) = 0. L 
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For a subset E of R”, we define the Hausdorff dimension of E, dimy4(£), by 
dimyy(£) = inf {8 >0| Hg(E)=0}. 


Hausdorff measures are particularly significant for Euclidean space R”. In the case 
n = 1, H, equals Lebesgue measure. To see this, let J C R be an interval. Given € > 0, 
the interval J may be expressed as the disjoint union of subintervals of length less that € 
and the diameter of each subinterval is its length. Thus H; and Lebesgue measure agree on 
the semiring of intervals of real numbers. Therefore, by the construction of these measures 


from outer measures, these measures also agree on the Borel sets. Thus H\)(E) is the 
Lebesgue outer measure of E. Forn > 1, H,, is not equal to Lebesgue measure (see Problem 
48) but it can be shown that it is a constant multiple of n-dimesional Lebesgue measure 
(see Problem 55). It follows from the above proposition that if A is a subset of R” that 
has positive Lebesgue measure, then dimy(A) =n. There are many specific calculations of 
Hausdorff dimension of subsets of Euclidean space. For instance, it can be shown that the 
Hausdorff dimension of the Cantor set is log 2/ log 3. Further results on Hausdorff measure, 
including specific calculations of Hausdorff dimensions, may be found in Yakov Pesin’s book 
Dimension Theory and Dynamical Systems [Pes98]. 


PROBLEMS 


46. Show that in the definition of Hausdorff measure one can take the coverings to be by open 
sets or by closed sets. 


47. Show that the set function outer Hausdorff measure H* is countably monotone. 


48. In the plane R’ show that a bounded set may be enclosed in a ball of the same diameter. Use 
this to show that for a bounded subset A of R, H)(A) > 4/7 - 2(A), where pz is Lebesgue 
measure on R?. 


49. Let (X, p) be a metric space and a > 0. For E € X, define 


H,(E) = inf S [diam(Ax)]*, 
k=l 


where {A;}7°., is a countable collection of subsets of X that covers E: there is no restriction 
regarding the size of the diameters of the sets in the cover. Compare the set functions H/, and 
Me 
50. Show that each Hausdorff measure H, on Euclidean space R” is invariant with respect to 
rigid motions. 
51. Give a direct proof to show that if J is a nontrivial interval in R", then H, (I) > 0. 
52. Show that in any metric space, Ho is counting measure. 


53. Let [a, b] be a closed, bounded interval of real numbers and R = {(x, y) € R’|a < x < 
b, y = 0.} Show that H>( R) = 0. Then show that Hj(R) = b—a. Conclude that the Hausdorff 
dimension of R is 1. 
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54. Let f: [a, b] + R be a continuous bounded function on the closed, bounded interval [a, 5] 
that has a continuous bounded derivative on the open interval (a, b). Consider the graph G 


of f as a subset of the plane. Show that H,(G) = f° 1+[f'(x)|? dx. 


55. Let J be an interval in R”, each of whose sides has length 1. Define y, = H,(J). Show 
that if J is any bounded interval in R”, then H,,(1) = yp - un(/). From this infer, using the 
uniqueness assertion of the Carathéodory-Hahn Theorem, that H, = yy - My, on the Borel 
subsets of R”. 
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In the study of Lebesgue measure, u,, and Lebesgue integration on the Euclidean spaces 
R” and, in particular, on the real line, we explored connections between Lebesgue measure 
and the Euclidean topology and between the measurable functions and continuous ones. 
The Borel o-algebra B(R”) is contained in the o-algebra of Lebesgue measurable sets. 
Therefore, if we define C,(R” ) to be the linear space of continuous real-valued functions on 
R" that vanish outside a closed, bounded set, the operator 


fof f dpy for all f € C.(R") 
Rt 


is properly defined, positive,! and linear. Moreover, for K a closed, bounded subset of R", 
the operator 


pf f dp, for all f € C(K) 
K 


is properly defined, positive, and is a bounded linear operator if C( K ) has the maximum norm. 

In this chapter we consider a general locally compact topological space (X, 7), the 
Borel o-algebra B(X) comprising the smallest c-algebra containing the topology 7, and 
integration with respect to a Borel measure w: B(X) — [0, co). The chapter has two 
centerpieces. The first is the Riesz-Markov Theorem, which tells us that if C.(X) denotes 
the linear space of continuous real-valued functions on X that vanish outside a compact 
set, then every positive linear function on C,(X) is given by integration against a Borel 
measure on B( X ). The Riesz-Markov Theorem enables us to prove the Riesz Representation 
Theorem, which tells us that, for X a compact Hausdorff topological space, every bounded 
linear functional on the linear space C(X), normed with the maximum norm, is given by 


1 linear functional L ona space of real-valued functions on a set X is called positive, provided L( f) > 0 
if f > 0 on X. But, for a linear functional, positivity means L(h) > L(g) if h > g on X. So in our view our 
perpetual dependence on the monotonicity property of integration, the adjective ‘“monotone” is certainly better 
than “positive.” However, we will respect convention and use of the adjective ‘‘positive.”’ 
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integration against a signed Borel measure. Furthermore, in each of these representations 
it is possible to choose the representing measure to belong to a class of Borel measures 
that we here name Radon, within which the representing measures are unique. The Riesz 
Representation Theorem provides the opportunity for the application of Alaoglu’s Theorem 
and Helley’s Theorem to collections of measures. 

The proofs of these two representation theorems require an examination of the 
relationship between the topology on a set and the measures on the Borel sets associated 
with the topology. The technique by which we construct Borel measures that represent 
functionals is the same one we used to construct Lebesgue measure on Euclidean space: 
We study the Carathéodory extension of premeasures defined on particular collection S 
of subsets of X, now taking S = 7, the topology on X. We begin the chapter with a 
preliminary section on locally compact topological spaces. In the second section we gather 
all the properties of such spaces that we need into a single theorem and provide a separate 
very simple proof of this theorem for X a locally compact metric space. 


21.1 LOCALLY COMPACT TOPOLOGICAL SPACES 


A topological space X is called locally compact provided each point in X has a neighborhood 
that has compact closure. Every compact space is locally compact, while the Euclidean spaces 
R” are examples of spaces that are locally compact but not compact. Riesz’s Theorem tells us 
that an infinite dimensional normed linear space, with the topology induced by the norm, is 
not locally compact. In this section we establish properties of locally compact spaces, which 
will be the basis of our subsequent study of measure and topology. 


Variations on Urysohn’s Lemma Recall that we extended the meaning of the word neigh- 
borhood and for a subset K of a topological space X call an open set that contains K a 
neighborhood of K. 


Lemma 1 Let x be a point in a locally compact Hausdorff space X and O a neighborhood of 
x. Then there is a neighborhood Y of x that has compact closure contained in O, that is, 


x €VCVCOand V is compact. 


Proof Let U/ be a neighborhood of x that has compact closure. Then the topological space 
U is compact and Hausdorff and therefore is normal. The set ON U is a neighborhood, with 
respect to the U/ topology, of x. Therefore, by the normality of U/, there is a neighborhood V 
of x that has compact closure contained in ONU: Here both neighborhood and closure mean 
with respect to the / topology. Since O and U are open in X, it follows from the definition of 
the subspace topology that V is open in X and Y C O where the closure now is with respect 
to the X topology. LJ 


Proposition 2. Let K be a compact subset of a locally compact Hausdorff space X and O a 
neighborhood of K. Then there is a neighborhood V of K that has compact closure contained 
in O, that is, 

KCVCVCOand V is compact. 


2There is no loss in understanding the interplay between topologies and measure if the reader, at first reading, 
just considers the case of metric spaces and skips Section 1. 
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Proof By the preceding lemma, each point x € K has a neighborhood NV, that has compact 
closure contained in O. Then {N,},cx is an open cover of the compact set K. Choose a finite 
subcover {NV x}ja1 Of K. The set V = U2, N,, is a neighborhood of K and 


The set \U?_, N x, , being the union of a finite collection of compact sets, is compact and hence 
So is V since it is a closed subset of a compact space. LJ 


For a real-valued function f on a topological space X, the support of f, which we 
denote by supp f, is defined > to be the closure of the set {x € X| f (x) #0}, that is, 


supp f = {xe X| f(x) #0}. 


We denote the collection of continuous functions f: X + R that have compact support by 
C.(X). Thus a function belongs to C,(X) if and only if it is continuous and vanishes outside 
of a compact set. 


Proposition 3 Let K be a compact subset of a locally compact Hausdorff space X and O a 
neighborhood of K. Then there is a function f belonging to C.(X) for which 


f =lonkK, f =0onX~Oand0 < f <1lonX. (1) 


Proof By the preceding proposition, there is a neighborhood V of K that has compact 
closure contained in 0. Since V is compact and Hausdorff, it is normal. Moreover, K and 
V~Vare disjoint closed subsets of V. According to Urysohn’s Lemma, there is a continuous 
real-valued function f on V for which 


f=1lonk, f=OonV~Vand0< f <1lonV. 


Extend f to all of X by setting f = 0 on X~YV. Then f belongs to C,(X) and has the 
properties described in (1). LJ 


Recall that a subset of a topological space is called a Gs set provided it is the intersection 
of a countable number of open sets. 


Corollary 4 Let K be a compact Gs subset of a locally compact Hausdorff space X. Then 
there a function f € C.(X) for which 


K={xeX| f(x) =1}. 


Proof According to Proposition 2, there is a neighborhood U/ of K that has compact closure. 
Since K is a Gs set, there is a countable collection {Ox}-°, of open sets whose intersection 


>This is different from the definition of support in the discussion of measurable sets in which the support of f 
was defined to be the set {x € X | f(x) #0}, not its closure. 
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is K. We may assume ©; C U for all k. By the preceding proposition, for each k there is a 
continuous real-valued function f; on X for which 


fe =1onK, fe =Oon X~O, and0 < fe <1onX. 


The function f defined by 
oO 
f= »> 2* fx on X 
k=1 
has the desired property. LJ 
Partitions of Unity 


Definition Let K be a subset of a topological space X that is covered by the open sets {Ox};_}. 
A collection of continuous real-valued functions on X, {g,}{_,, is called a partition of unity 
for K subordinate to {O;}),_, provided 


suppg; CO;,0 <9; <1lonX forl<i<n 


and 
Qrtgo.t+...t+g, =lonkK. 


Proposition 5 Let K be a compact subset of a locally compact Hausdorff space X and {O;};_, — 
a finite cover of K by open sets. Then there is a partition of unity {p,};_, for K subordinate to 
this finite cover and each gx has compact support. 


Proof We first claim that there is an open cover {U/;,}"_, of K such that for each k, U; is 
a compact subset of O;,. Indeed, invoking Proposition 2 n times, for each x € K, there is a 
neighborhood NV, of x that has compact closure and such that if 1 < j <n and x belongs to 
O,;, then NV, C Oj. The collection of open sets {Nx}xex is a cover of K and K is compact. 
Therefore there is a finite set of points {x;,}/"_, in K for which {N x,}1<k<m also covers K. For 
1 <k <n, let U; be the unions of those N’,,’s that are contained in O;. Then {U1,..., Un} 
is an open cover of K and for each k, U/; is a compact subset of O; since it is the finite union 
of such sets. We infer from Proposition 3 that for each k,1 < k <n, there is a function 
fi € Cce(X) for which f, = 1 0n Ui; and f = 0 on X ~O,. The same proposition tells us that 
there is a function h € C(X) for which h = 1 on K andh = 0 on X~ Uj_, Uk. Define 


f= »> fe on X. 
k=1 
Observe that f+fl - h|>0 on X and h —OonK. Therefore if we define 


tk 


~__Ske___ oon x tort <k< a 
Fala es 


Pk 


{px hi—y isa partition of unity for K subordinate to {O;};_, and each y; has compact support. 
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The Alexandroff One-Point Compactification If X is a locally compact Hausdorff space, 
we can form a new space X* by adjoining to X a single point w not in X and defining a set 
in X* to be open provided it is either an open subset of X or the complement of a compact 
subset in X. Then X* is a compact Hausdorff space, and the identity mapping of X into X* 
is a homeomorphism of X and X* ~ {w}. The space X* is called the Alexandroff one-point 
compactification of X, and w is often referred to as the point at infinity in X*. 

The proof of the the following variant, for locally compact Hausdorff spaces, 
of the Tietze Extension Theorem nicely illustrates the usefulness of the Alexandroff 
compactification. 


Theorem 6 Let K be a compact subset of a locally compact Hausdorff space X and f a 
continuous real-valued function on K. Then f has an extension to a continuous real-valued 
function on all of X. 


Proof The Alexandroff compactification of X, X*, is a compact Hausdorff space. Moreover, 
K is a closed subset of X*, since its complement in X* is open. A compact Hausdorff space 
is normal. Therefore we infer from the Tietze Extension Theorem that f may be extended 
to a continuous real-valued function on all of X*. The restriction to X of this extension is a 
continuous extension of f to all of X. L] 


PROBLEMS 


1. Let X be a locally compact Hausdorff space, and F a set that has closed intersection with 
each compact subset of X. Show that F is closed. 


2. Regarding the proof of Proposition 3: 
(i) Show that F and Y~ V are closed subsets of V. 
(ii) Show that the function f is continuous. 

3. Let X be a locally compact Hausdorff space and X* the Alexandroff one-point compactifica- 
tion of X: 


(i) Prove that the subsets of X* that are either open subsets of X or the complements of 
compact subsets of X are a topology for X*. 


(ii) Show that the identity mapping from X to the subspace X* ~ {w} is a homeomorphism. 
(iit) Show that X* is compact and Hausdorff. 


4. Show that the Alexandroff one-point compactification of R” is homeomorphic to the n-sphere 
S” = {x € R™ | |x|] = 1}. 


5. Show that an open subset of a locally compact Hausdorff space, with its subspace topology, 
is locally compact. 


6. Show that a closed subset of a locally compact space, with its subspace topology, is locally 
compact. 


7. Show that a locally compact Hausdorff space X is compact if and only if the set consisting of 
the point at infinity is an open subset of the Alexandroff one-point compactification X* of X. 


8. Let X be a locally compact Hausdorff space. Show that the Alexandroff one-point compacti- 
fication X* is separable if and only if X is separable. 


14. 


15. 


16. 


17. 


18. 


19. 
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. Consider the topological space X consisting of the set of real numbers with the topology that 


has complements of countable sets as a base. Show that X is not locally compact. 


. Provide a proof of Proposition 3 by applying Urysohn’s Lemma to the Alexandroff one-point 


compactification of X. 


. Let f continuously map the locally compact Hausdorff space X onto the topological space Y. 


Is Y necessarily locally compact? 


. Let X be a topological space and f a continuous function of X that has compact support. 


Define K = {x € X | f(x) = 1}. Show that K is a compact Gs set. 


. Let O be an open subset of a compact Hausdorff space X. Show that the mapping of X to the 


Alexandroff one-point compactification of O that is the identity on O and takes each point in 
X ~ O into w is continuous. 


Let X and Y be locally compact Hausdorff spaces, and f a continuous mapping of X into 
Y. Let X* and Y* be the Alexandroff one-point compactifications of X and Y, and f* the 
mapping of X* into Y* whose restriction to X is f and that takes the point at infinity in X* 
into the point at infinity in Y*. Show that f* is continuous if and only if f—!( K) is compact 
whenever K C Y is compact. A mapping f with this property is said to be proper. 


Let X be a locally compact Hausdorff space. Show that a subset F of X is closed if and only if 
F 1 K 1s closed for each compact subset K of X. Moreover, show that the same equivalence 
holds if instead of being locally compact the space X is first countable. 


Let ¥ be a family of real-valued continuous functions on a locally compact Hausdorff space 
X which has the following properties: 


(i) If f€ F andg ef, then f+geF. 
(ii) If f ¢ F and g € F, then f/g € F, provided that supp f C {x € X | g(x) #0}. 


(iii) Given a neighborhood O of a point xo € X, there is a f € F with f(xo) =1,0< f <1 
and supp f CO. 


Show that Proposition 5 is still true if we require that the functions in the partition of unity 
belong to F. 


Let K be a compact Gg subset of a locally compact Hausdorff space X. Show that there is 
a decreasing sequence of continuous nonnegative real-valued functions on X that converges 
pointwise on X to the characteristic function of K. 


The Baire Category Theorem asserts that in a complete metric space the intersection of a 
countable collection of open dense sets is dense. At the heart of its proof lies the Cantor 
Intersection Theorem. Show that the Fréchet Intersection Theorem is a sufficiently strong 
substitute for the Cantor Intersection Theorem to provide a proof of the following assertion 
by first proving it in the case in which X is compact: Let X be a locally compact Hausdorff 
space. 


(i) If {F,}P°, is a countable collection of closed subsets of X for which each F, has empty 


interior, then the union 7°, F, also has empty interior. 


(ii) If {On}?2, is a countable collection of open dense subsets of X, then the intersection 
Mo, On also is dense. 


Use the preceding problem to prove the following: Let X be a locally compact Hausdorff 
space. If O is an open subset of X that is contained in a countable union UU, F, of closed 
subsets of X, then the union of their interiors, 2, int F,, is an open dense subset of O. 
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20. For a map f: X — Y and acollection C of subsets of Y we define f*C to be the collection of 
subsets of X given by 


fic= {E | E = f—'[C] for some C € cl. 


Show that if A is the c-algebra generated by C, then f*.A is the o-algebra generated by f*C. 


21. Fora map f: X — Y and acollection C of subsets of X, let A be the c-algebra generated by 
C. If f-![f[C]] = C for each C € C, show that f—![f[A]] = A for each A € A. 


21.2 SEPARATING SETS AND EXTENDING FUNCTIONS 


We gather in the statement of the following theorem the three properties of locally compact 
Hausdorff spaces which we will employ in the proofs of our forthcoming representation 
theorems. 


Theorem 7 Let (X, 7) be a Hausdorff space. Then the following four properties are 
equivalent: 


(i) (X, T) is locally compact. 
(ti) If O is a neighborhood of a compact subset K of X, then there is a neighborhood U of 
K that has compact closure contained in O. 
(ui) If O is a neighborhood of a compact subset K of X, then the constant function on K 
that takes the value 1 may be extended to a function f in C,(X) for which 0 < f <1 
on X and f vanishes outside of O. 


(iv) For K a compact subset of X and F a finite open cover of K, there is a partition of 
unity subordinate to F consisting of functions of compact support. 


Proof We first establish the equivalence of (i) and (ii). Assume (ii) holds. Let x be a point in 
X. Then X is a neighborhood of the compact set {x}. By property (ii) there is a neighborhood 
of {x} that has compact closure. Thus X is locally compact. Now assume that X is locally 
compact. Proposition 2 tells us that (ii) holds. 


Next we establish the equivalence of (i) and (iii). Assume (iii) holds. Let x be a point 
in X. Then X is a neighborhood of the compact set {x}. By property (iii) there is a function 
f in C,(X) to take the value 1 at x. Then O = f—'(1/2, 3/2) is a neighborhood of x and it 
has compact closure since f has compact support and O C f~'[1/2, 3/2]. Thus X is locally 
compact. Now assume that X is locally compact. Proposition 3 tells us that (iii) holds. 


Finally, we establish the equivalence of (1) and (iv). Assume property (iv) holds. Let x 
be a point in X. Then X is a neighborhood of the compact set {x}. By property (iv) there is a 
single function f that is a partition of unity subordinate to the covering of the compact set 
{x} by single open set X. Then O = f—'(1/2, 3/2) is a neighborhood of x and it has compact 
closure. Thus X is locally compact. Now assume that X is locally compact. Proposition 5 tells 
us that (iv) holds. _ O 


The substantial implications in the above theorem are that a locally compact Hausdorff 
space possesses properties (ii), (ili), and (iv). Their proofs, which we presented in the 
preceding section, depend on Urysohn’s Lemma. It is interesting to note, however, that if X 
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is a locally compact metric space, then very direct proofs show that X possesses properties 
(ii), (iii), and (iv). Indeed, suppose there is a metric p: X X X — R that induces the topology 
T and X is locally compact. 


Proof of Property (ii) For each x € K, since O is open and X is locally compact, there 
is an open ball B(x, r,,.) of compact closure that is contained in O. Then {B(x, rx/2)}xex is 
a cover of K by open sets. The set K is compact. Therefore there are a finite set of points 
X1,...,%, in K for which {B(x, ry,/2)}i<k<n cover K. Then U = Uj<pen BX, Py,/2) 18 a 
neighborhood of K that, since U CU <p<n B(x, x,/2), has closure contained in O. 


Proof of Property (iii) For asubset A of X, define the function called the distance to 
A and denoted by dist4: X — [0, oo) by 


dist,(x) = inf p(x, y) forx € X. 


The function dist, is continuous; indeed, it is Lipschitz with Lipschitz constant 1 (see Problem 
25). Moreover, if A is closed subset of X, then dist,(x) = Oif and only ifx € A. For Oa 
neighborhood of a compact set K, by part (i) choose U/ to be a neighborhood of K that has 
compact closure contained in O. Define 


disty ~y 


= —————_ on X 
disty ~ 4 + distx 


Then f belongs to C,(X ), takes values in [0, 1], f =10n K and f = 0 on X~O. 


Proof of Property (iv) This follows from properties (ii) and (i) as it did in the case in 
which X is Hausdorff but not necessarily metrizable; see the proof of Proposition 5. 

We see that property (ii) is equivalent to the assertion that two disjoint closed subsets 
of X, one of which is compact, may be separated by disjoint neighborhoods. We therefore 
refer to property (ii) as the locally compact separation property. It is convenient to call (iii) 
the locally compact extension property. 


PROBLEMS 
22. Show that Euclidean space R” is locally compact. 
23. Show that £?, for 1 < p < ov, fails to be locally compact. 
24. Show that C([0, 1]), with the topology induced by the maximum norm, is not locally compact. 
25. Let p: X XX > R bea metric on a set X. For A C X, consider the distance function 


dist, : X — [0, 00). 


(i) Show that the function dist, is continuous. 
(ii) If A C X isclosed and x is a point in X, show that dist 4(x) = 0 if and only if x belongs to A. 


(iii) If A C X is closed and x belongs to X, show that there may not exist a point xo in A for 
which dist4(x) = p(x, xo), but there is such a point xo if K is compact. 
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26. Show that property (ii) in the statement of Theorem 7 is equivalent to the assertion that 
two disjoint closed subsets of X, one of which is compact, may be Separated by disjoint 
neighborhoods. 


21.3 THE CONSTRUCTION OF RADON MEASURES 


Let (X, T) be a topological space. The purpose of this section is to construct measures on 
the Borel o-algebra, B( X ), comprising the smallest o-algebra that contains the topology T. 
A natural place to begin is to consider premeasures yz: T —> [0, 00] defined on the topology 
T and consider the Carathéodory measure induced by y. If we can establish that each open 
set is measurable with respect to y*, then, by the minimality with respect to inclusion of 
the Borel o-algebra among all o-algebras containing the open sets, each Borel set will be 
p*-measurable and the restriction of u* to B(X) will be an extension of x. We ask the 
following question: What properties of 4: T -> [0, oo] are sufficient in order that every 
open set be measurable with respect to 4*, the outer measure induced by zy. It is not useful 
to invoke the Carathéodory -Hahn Theorem here. A topology, in general, is not a semiring. 
Indeed, it is not difficult to see that a Hausdorff topology T is a semiring if and only if T is 
the discrete topology, that is, every subset of X is open (see Problem 27). 


Lemma 8 Let (X, T ) be a topological space, u: T > [0, 00] a premeasure, and 1* the outer 
measure induced by jw. Then for any subset E of X, 


u*(E) = inf {u(U) | Uaneighborhood of E}. (2) 
Furthermore, E is w*-measurable if and only if 
w(O) > w*(ON E) + w*(O~ E) for each open set O for which p(O) < ov. (3) 


Proof Since the union of any collection of open sets is open, (2) follows from the countable 
monotonicity of u. Let E be a subset of X for which (3) holds. To show that E is p*- 
measurable, let A be a subset of X for which u*(A) < oo and let e > 0. We must show that 


p(A) +> wX(ANE) + p*(A~B). (4) 
By the above characterization (2) of outer measure, there is an open set © for which 
ACO and u*(A) +€> y*(0). 6) 
On the other hand, by (3) and the monotonicity of p*, 
w(O) > w(ONE)+p*(O~E) > w*(ANE) +p *(A~E). (6) 
Inequality (4) follows from the inequalities (5) and (6). LC 


Proposition9 Let (X, T) bea topological space and w: T -> [0, 00] a premeasure. Assume 
that for each open set O for which u(O) < ov, 


p(O) = sup (U1) | U openand UC oO}. (7) 


Then every open set is u*-measurable and the measure u*: B( X) — [0, oo] is an extension of . 
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Proof A premeasure is countable monotone and hence, for each open set V, u*(V) = p(V). 
Therefore, by the minimality property of B( X ), to complete the proof it suffices to show that 
each open set is 4*-measurable. 


Let V be open. To verify the x*-measurability of V it suffices, by the preceding lemma, 
to let O be open with 4(O) < oo, let € > 0 and show that 


w(O) +6 > w(ONV) +p (O~YV). (8) 


However, OM V is open and, by the monotonicity of 4, 4(OM V) < oo. By assumption (7) 
there is an open set U for which U CON V and 


u(U) > p(ONYV) —«. 


The pair of sets U and O~ UY are disjoint open subsets of O. Therefore by the monotonicity 
and finite additivity of the premeasure p, 


u(O) = w(UU[O~U]) = w(U) + w(O~U). 
On the other hand, since UC VNO, 
O~V=O0~[ONY]CO~U. 
Hence, by the monotonicity of outer measure, 


w(O~U) > w*(O~Y). 


Therefore _ 
w(O) >= h(U)+ph(O~U) 
> w(ONV) -—e+n(O~l) 
> p(ONV)—e+p*(O~YV). 
We have established (8). The proof is complete. LI 


Definition Let (X, T) bea topological space. We call a measure yt on the Borel o-algebra 
B(X) a Borel measure provided every compact subset of X has finite measure. A Borel 
measure pt is called a Radon measure provided 


(i) (Outer Regularity) for each Borel subset E of X, 
w(E) = inf {u(U) | U aneighborhood of E}; 
(ii) (Inner Regularity) for each open subset O of X, 
u(O) = sup {u(K) | K acompact subset of O}. 


We proved that the restriction to the Borel sets of Lebesgue measure on a Euclidean 
space R” is a Radon measure. A Dirac delta measure on a topological space is a Radon 
measure. 
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While property (7) is sufficient in order for a premeasure w: T — [0, oo] to be 
extended by the measure yux*: B -> [0, oo], in order that this extension be a Radon measure 
it is necessary, if X is locally compact Hausdorff space, that » be what we now name a Radon 
premeasure (see Problem 35). 


Definition Let (X, T) be a topological space. A premeasure 4: T -> [0, ov] is called a 
Radon premeasure* provided 


(i) for each open set U that has compact closure, w(U) < 00; 
(it) for each open set O, 


p(O) = sup e(U ) | U openand U a compact subset of oO}. 


Theorem 10 Let (X, T ) be a locally compact Hausdorff space and 1: T —> [0, 00] a Radon 
premeasure. Then the restriction to the Borel o-algebra B(X) of the Carathéodory outer 
measure * induced by yt is a Radon measure that extends p.. 


Proof A compact subset of the Hausdorff space X is closed, and hence assumption (ii) 
implies property (7). According to Proposition 9, the set function u*: B(X) — [0, oo] is 
a measure that extends yw. Assumption (i) and the locally compact separation property 
possessed by X imply that if K is compact, then u*(K) < oo. Therefore p*: B(X) > [0, oo] 
is a Borel measure. Since pw is a premeasure, Lemma 8 tells us that every subset of X and, 
in particular, every Borel subset of X, is outer regular with respect to u*. It remains only 
to establish the inner regularity of every open set with respect to u*. However, this follows 
from assumption (ii) and the monotonicity of p*. LI 


The natural functions on a topological space are the continuous ones. Of course ev- 
ery continuous function on a topological space X is measurable with respect to the Borel 
o-algebra B(X). For Lebesgue measure on R, we proved Lusin’s Theorem, which made 
precise J. E. Littlewood’s second principle: a measurable function is “nearly continuous.” 
We leave it as an exercise (see Problem 39) to prove the following general version of Lusin’s 
Theorem. 


Lusin’s Theorem Let X be a locally compact Hausdorff space, w: B( X) — [0, 00) a Radon 
measure, and f: X — Ra Borel measurable function that vanishes outside of a set of finite 
measure. Then for each € > 0, there is a Borel subset Xo of X and a function g € C,(X) 
for which 

f = gon Xo and p(X ~ Xo) <e. 


PROBLEMS 


27. Let (X, T) be a Hausdorff topological space. Show that T is a semiring if and only if T is the 
discrete topology. 


28. (Tyagi) Let (X, T) be a topological space and 4: T > [0, co] a premeasure. Assume that if 
O is open and (0) < oo, then x(bd 0) = 0. Show that every open set is *-measurable. 


4What is here called a Radon measure is often called a regular Borel measure or a quasi-regular Borel measure. 
What is here called a Radon premeasure is sometimes called a content or inner content or volume. 


29. 


30. 


31. 
32. 


33. 
34. 


35. 


36. 


37. 


38. 


39. 
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Show that the restriction of Lebesgue measure on the real line to the Borel o-algebra is a 
Radon measure. 


Show that the restriction of Lebesgue measure on the Euclidean space R” to the Borel 
o-algebra is a Radon measure. 


Show that a Dirac delta measure on a topological space is a Radon measure. 


Let X be an uncountable set with the discrete topology and {xx}1<,<o a countable subset of 
X. For E C X, define 
mW E)= SY 27, 
{n | X,€E} 


Show that 2* = B(X) and w: B(X) > isa Radon measure. 
Show that the sum of two Radon measures also is Radon. 


Let yx and v be Borel measures on B(X), where X is a compact topological space, and suppose 
that w is absolutely continuous with respect to v. If vy is Radon show that wp also is Radon. 


Let (X, T ) bea locally compact Hausdorff space and w: T — [0, co] a premeasure for which 
the restriction to B( X ) of u* is a Radon measure. Show that w is a Radon premeasure. 


Let X be a locally compact Hausdorff space and py: B( X) > [0, oo] a Radon measure. Show 
that any Borel set E of finite measure is inner regular in the sense that 


p(E) =sup {u(K)| K CE, K compact}. 
Conclude that if w is o-finite, then every Borel set is inner regular. 
Let X be a topological space, uw: B(X) — [0, co] a o-finite Radon measure, and E C X a 
Borel set. Show that there is a Gs subset A of X and an F,, subset B of X for which 

ACEC Band u(B~E) =p(E~A) =0. 
For a metric space X, show that B(X) is the smallest o-algebra with respect to which all of 
the continuous real-valued functions on X are measurable. . 


Prove Lusin’s Theorem as follows: 
(i) First prove it for simple functions by using the inner regularity of open sets and the 
locally compact extension property. 


(ii) Use part (i) together with Egoroff’s Theorem and the Simple Approximation Theorem 
to complete the proof. 


21.4 THE REPRESENTATION OF POSITIVE LINEAR FUNCTIONALS ON C_(X): 


THE RIESZ-MARKOV THEOREM 


Let X be a topological space. A real-valued functional % on C(X) is said to be monotone 
provided #(g) > W(h) if g > hon X, and said to be positive provided ( f) > 0 if f > 0on 
X. If pis linear, W(g —h) = (gg) — W(h) and, of course, if f = g—h, then f > 0 on X if and 
only if g > h on X. Therefore, for a linear functional, positivity is the same as monotonicity. 


Proposition 11 Let X be a locally compact Hausdorff space and 1, "2 be Radon measures 
on B(X) for which 


| fd =| Fano forall f C.(X). 
X X 


Then py = p22. 
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Proof By the outer regularity of every Borel set, these measures are equal if and only if 
they agree on open sets and therefore, by the inner regularity of every open set, if and only 
if they agree on compact sets. Let K be a compact subset of X. We will show that 


1(K) = po(K). 


Let e« > 0. By the outer regularity of both 4; and jy and the excision and monotonicity 
properties of measure, there is a neighborhood O of K for which 


M1(O~ K) <€/2 and po(O~K) <e/2. (9) 


Since X is locally compact and Hausdorff, it has the locally compact extension property. 
Hence there is a function f € C,(X) for which 0 < f <1on X, f =0on X~O, and f =1 
on K. For i = 1, 2, 


[ sauc= [fans | faut | faui= | fduj t+ wi(K). 
xX O O~K K xXx~O 


By assumption, 
| fdpy =| f dpz. 
x Xx 


mi(k)—a(K)=[ fdyo— [fin 


Therefore 


But 0 < f <1 0n X and we have the measure estimates (9). Hence, by the monotonicity of 
integration, 


ui (K) — u2(K)| cf fais ff 


Therefore 1(K) = 2(K). The proof is complete. LI 


f dp2 <e. 
K 


~ 


The Riesz-Markov Theorem Let X be a locally compact Hausdorff space and I a positive 
linear functional on C,(X). Then there is a unique Radon measure 1 on B(X), the Borel 
o-algebra associated with the topology on X, for which 


ufy= | tai for all f € C.(X). (10) 


Proof? Define (@) = 0. For each nonempty open subset © of X, define 


w(O) =sup {1(f)| feCe(X), O< f <1,supp f CO}. 
To prove the theorem we need to determine the measure of a set by knowing the values of the “integrals” of 


certain functions. It is an instructive exercise to show that if yz. is Lebesgue measure of R and J = (a, b), an open, 
bounded interval, then 


puy=bma= spf Fae f €C-(R), 0<f<tampp sch 
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Our strategy is to first show that yu is a Radon premeasure. Hence, by Theorem 10, if we 
denote by the restriction to the Borel sets of the outer measure induced by y, then i is a 
Radon measure that extends y. We then show that integration with respect to 1 represents 
the functional J. The uniqueness assertion is a consequence of the preceding proposition. 


Since J is positive, y takes values in [0, co]. We begin by showing that pis a premeasure. 
To establish countable monotonicity, let {O;,}?°, be a collection of open subsets of X that 
covers the open set O. Let f be a function in C,( X) with 0 < f <1 and supp f CO. Define 
K = supp f. By the compactness of K there is a finite collection {O;};_, that also covers K. 
According to Proposition 5, there is a partition of unity subordinate to this finite cover, that 
is, there are functions 91, ..., @, in C,(X) such that 


n 
> 9 =10n K and, for1 <k <n,0 < gy <1onX and supp gy, C Oy. 
i=l 
Then, since f =O0on X~K, 
f= > o%-fonX and, forl<k<n, 0< f-g <1and supp(% - f) CO. 
k=1 
By the linearity of the functional J and the definition of p, 
n n n OO 
KU f)= Yo: fl) => Mee: f) < > wr) < > w(Ox)- 
k=1 k=1 k=1 k=1 
Take the supremum over all such f to conclude that 


OO 
w(O) < > w(Ox). 
k=1 
Therefore pu is countably monotone. 


Since yz is countably monotone and, by definition, 4(%) = 0, yu is finitely monotone. 
Therefore to show that yp is finitely additive it suffices, using an induction argument, to let 
O = O; UO, be the disjoint union of two open sets and show that 


u(O) > w(O1) + w(O2). (11) 
Let the functions f,, fo belong to C,(X) and have the property that for 1 < k < 2, 
0< f <1and supp fi, C Or. 


Then the function f = f, + fo has support contained in O, and, since O, and ©) are disjoint, 
0 < f < 1. Again using the linearity of J and the definition of 4, we have 


I f)+( fo) =f) < w(O). 


If we first take the supremum over all such f; and then over all such f2 we have 


w(O1) + w(O2) < pO). 
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Hence we have established (11) and thus the finite additivity of 4. Therefore p is a 
premeasure. 


We next establish the inner regularity property of a Radon premeasure. Let O be open. 
Suppose j4(O) < oo. We leave the case 4(O) = oo as an exercise (see Problem 43). Let «> 0. 
We have to establish the existence of an open set U/ that has compact closure contained in O 
and y(U/) > w(O) — «. Indeed, by the definition of y, there is a function f. € C.(X) that has 
support contained in O and for which I( f.) > u(O) — «. Let K = supp f. But X is locally 
compact and Hausdorff and therefore has the locally compact separation property. Choose 
U to be a neighborhood of K that has compact closure contained in O. Then 


p(U) > I( fe) > w(O) -e. 
It remains only to show that if O is an open set of compact closure, then 4(O) < oo. But X 
is locally compact and Hausdorff and therefore has the locally compact extension property. 
Choose a function in C,(X) that takes the constant value 1 on O. Thus, since J is positive, 
p(O) < I( f) < ov. This concludes the proof that uw is a Radon premeasure. 


Theorem 10 tells us that the Carathéodory measure induced by p restricts to a Radon 
measure jz on B(X) that extends yw. We claim that (10) holds for jz. The first observation 
is that a continuous function is measurable with respect to any Borel measure and that a 
continuous function of compact support is integrable with respect to such a measure since 
compact sets have finite measure and continuous functions on compact sets are bounded. By 
the linearity of J and of integration with respect to a given measure and the representation 
of each f € C,(X) as the difference of nonnegative functions in C,(X), to establish (10) it 
suffices to verify that 


I(f) = | fafitorat f © C.(x) for which 0 < f<i. (12) 


Let f belong to C.(X). Fix a natural number n. For 1 < k < n, define the function 
gy: X — [0, 1] as follows: 


1 if f(x) > + 
er(x)=4 nf(x)—(k-1) if P< f(x) sf 
0 if f(x) < 


The function ; is continuous. We claim that 
1 n 
f=->@onX. (13) 
Meal 


To verify this claim, let x belong to X. If f(x) = 0, then g(x) = 0 for 1 < k <n, and 
therefore (13) holds. Otherwise, choose kp such that 1 < kp < n and om! <f(x)< “ Then 


1 f1<k<kj-1 
or(x) =< nf(x)—(ko-1) fk =k 
. 4 0 - ifkg <k <n. 


Thus (13) holds. 
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Since X is locally compact and Hausdorff, it has the locally compact separation property. 
Therefore, since supp f 1s compact, we may choose an open set O of compact closure for 
which supp f C O. Define Op = O, O,4; = @ and, for 1 < k <n, define 


k-—1 
= {re0| r(x)>—} 
n 
By construction, 


supp ¢ C Ox C Oy_1 and gy = 1 on Oj 41. 


Therefore, by the monotonicity of J and of integration with respect to i, and the definition 
of p, 


M(Oxs1) < Toe) < w(Op_-1) = w(Ox) + [m(Ox_1) — w(Ox)] 
and 
(Ort) < [ or dit < u(Op-1) = w(Ox) + [u(Ox-1) — u(r). 


However, 


p(O) = (Oo) = w(O1) >... = w(O,-1) =... > w(On) = 0. 


Therefore, since the compactness of O implies the finiteness of u(O), we have 


-u(0)-n(0) < 3 10) - | eah) < (0) +0(0), 
k=1 x 


Divide this inequality by n, use the linearity of J and of integration, together with (13) 
to obtain 


<“4(0). 


uf)- | faa 
x 
This holds for all natural numbers n and 4(O) < oo. Hence (10) holds. LO 


PROBLEMS 


40. Let X be a locally compact Hausdorff space, and Co(X) the space of all uniform limits of 
functions in C,(X). 


(i) Show that a continuous real-valued function f on X belongs to Co(X) if and only if for 
each a > 0 the set {x € X||f(x)| = a} is compact. 


(11) Let X* be the one-point compactification of X. Show that Co( X) consists precisely of the 
restrictions to X of those functions in C( X* ) that vanish at the point at infinity. 


41. Let X be an uncountable set with the discrete topology. 
(i) Whatis C,(X)? 
(ii) What are the Borel subsets of X? 
(iii) Let X* be the one-point compactification of X. What is C( X*)? 
(iv) What are the Borel subsets of X*? 


(v) Show that there is a Borel measure ys on X* such that u(X*) = 1 and f, f du =0 for 
each f inC,(X). 
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42. Let X and Y be two locally compact Hausdorff spaces. 
(i) Show that each f € C,(X XY) is the limit of sums of the form 


S 91(2)i(9) 


where g; € C,(X) and W; = C,(Y). (The Stone-Weierstrass Theorem is useful.) 
(ii) Show that B(X XY) C B(X) XB(Y 


). 
(iti) Show that B(X < Y) = B(X) X B(Y) if and only if X or Y is the union of a countable 
collection of compact subsets. 


43. In the proof of the Riesz-Markov Theorem, establish inner regularity in the case in which 
p(O) = oo. 


44. Let k(x, y) be a bounded Borel measurable function on X x Y, and let « and v be Radon 
measures on X and Y. 


(i) Show that 


Jf, eee Wy)d(mxXv) = [ J, (x )k(x, y)dp. 


_ [ a] i k(x, y)W(y)dv 


(y)dv 


du 


for ally € C,(X) andw €C,(Y). 
(ii) Ifthe integral in (i) is zero for all p and # in C,(X) and C,(Y), show that then k = 0a.e. 
[uw Xv]. 
45. Let X be a compact Hausdorff space and yu a Borel measure on B( X). Show that there is a 
constant c > 0 such that 
ff ta 
x 


21.5 THE RIESZ REPRESENTATION THEOREM FOR THE DUAL OF C(X) 


Let X be acompact Hausdorff space and C(X) = C,(X) the space of real-valued continuous 
functions on X. In the preceding section, we described the positive linear functionals on C(X). 
We now consider C(X) as a normed linear space with the maximum norm and characterize 
the continuous linear functionals on C(X ). First observe that each positive linear functional 
is continuous, that is, is bounded. Indeed, if L is a positive linear functional on C(X) and 
f €C(X) with || f|| < 1, then —-1 < f < 1 on X and hence, by the homogeneity and positivity 
of L, -L(1) < L(f) < L(1), that is, |L( f)| < L(1). Therefore L is bounded and the norm 
of the functional L equals the value of L at the constant function with value 1, that is, 


< cll fllmax for all f ¢ C(X). 


LI = L(A). 


Jordan’s Theorem tells us that a function of bounded variation may be expressed as the 
difference of increasing functions. Therefore, for X = [a, b], Lebesgue-Stieltjes integration 


Section 21.5 The Riesz Representation Theorem for the Dual of C(X) 463 


against a function of bounded variation may be expressed as the difference of positive 
linear functionals. According to the Jordan Decomposition Theorem, a signed measure 
may be expressed as the difference of two measures. Therefore integration with respect 
to a signed measure may be expressed as the difference of positive linear functionals. The 
following proposition is a variation, for general continuous linear functionals on C(X), of 
these decomposition properties. 


Proposition 12 Let X be a compact Hausdorff space and C(X ) the linear space of continuous 
real-valued functions on X, normed by the maximum norm. Then for each continuous linear 
functional L on C(X), there are two positive linear functionals L, and L_ on C(X) for which 


L=L4 — L_ and ||L|| = L4(1) + L_(1). 
Proof For f €¢ C(X) such that f > 0, define 


Li(f)= up EH): 


Since the functional L is bounded, L4(f) is a real number. We first show that for 
f >0,¢g>O0andc>0, 


Li(cf) =cLi(f) andLi(f+g) =L+(f)+L+(g). 


Indeed, by the positive homogeneity of L, Li(cef) =cL4(f) force > 0. Let f and g be two 
nonnegative functions in C(X). If0< go < fand0 <y < g,then0<9+y < f+gandso 


L(g) + L(t) =L(o +) < Lil(f +e). 


Taking suprema, first over all such » and then over all such &, we obtain 


Li(f)+Li(g) = Li(f +e). 
On the other hand, 1f0 < W < f+g,then0 < min{y, f} < fandthus0 < p—min{y, /f} < g, 
and therefore . . 
L(y) =L(min{y, f}) + L(y —[min{y, f}]) 
<Li(f)+Li(g). 
Taking the supremum over all such , we get 
Li(f+e9) <Lil(f) + L+(g). 


Therefore 
Li(ft+g)=Li(f)+L4(g). 


Let f be an arbitrary function in C(X), and let M and N be two nonnegative constants 
for which f + M and f + N are nonnegative. Then 


Li(f+M+N)=Li(f+M)+Li(N) =Li(f+N)+L4(M). 
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Hence 

Li(f+M)-Ls(M)=Li(f+N)—-L4(N). 
Thus the value of Li( f+ M)— L(M) is independent of the choice of M, and we define 
L+(f) to be this value. 

Clearly, L+: C(X) — R is positive and we claim that it is linear. Indeed, it is clear 
that Li(f +g) = Li(f)+Li(g). We also have L4(cf) = cL,(f) for c > 0. On the 
other hand, L,(—f) + Li(f) = L+(0) = 0, so that we have Li(—f) = —L+(f). Thus 
Li(cf) =cL4(f) for all c. Therefore L.. is linear. 

Define L_ = L, — L. Then L_ 1s a linear functional on C(X ) and it is positive since, 
by the definition of L4, L( f) < Li(f) for f > 0. We have expressed L as the difference, 
L. — L_, of two positive linear functionals on C(X). 

We always have ||L|| < ||Z+|| + ||L-|| = £4(1) + L_(1). To establish the inequality in 
the opposite direction, let ¢ be any function in C(X) for which 0 < o < 1. Then ||2~—1|| < 1 
and hence 

Z|] => L(2@ — 1) =2L(@) — L(1). 


Taking the supremum over all such , we have 
Ll] > 2L4(1) — L(1) = L4(1) + L_(1). 


Hence ||L|| = L.(1) + L_(1). LJ 


For a compact topological space X, we call a signed measure on B(X ) a signed Radon 
measure provided it is the difference of Radon measures. We denote by Radon(X) the 
normed linear space of signed Radon measures on X with the norm of v € Radon(X) given 
by its total variation ||v||yar, which, we recall, may be expressed as 


lvllvar = vt (X) +7(X), 


where vy = v* — v7 is the Jordan decomposition of v. We leave it as an exercise to show that 
|| - ||var 1s a norm on the linear space of signed Radon measures. 


The Riesz Representation Theorem for the Dual of C(X) Let X be a compact Hausdorff 
space and C(X) the linear space of continuous real-valued functions on X, normed by the 
maximum norm. Define the operator T : Radon( X) — [C(X)]* by setting, forv € Radon(X), 


(f)= | fav foralt fin C(x), 


Then T is a linear isometric isomorphism of Radon( X ) onto [C(X)]*. 


Proof Let L be a bounded linear functional on C( X ). By the preceding proposition, we may 
choose positive linear functionals L; and Lz on C(X) for which L = L; — L2. According to 
the Riesz-Markov Theorem, there are Radon measures on X, p, and pz, for which 


i(f)= | fan and La(f) = ff dua for all f € C(X). 
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Define pz = pu — #2. Thus p is a signed Radon measure for which L = T,. Hence T is 
onto. We infer from this and Proposition 11 that the representation of L as the difference of 
positive linear functionals is unique. Therefore, again by the preceding proposition, 


[ILI] = Li(1) + Lo(1) = aa(X) + mo(X) = lal(X) 
Therefore T is an isomorphism. O 


Corollary 13. Let X be a compact Hausdorff space and K* a bounded subset of Radon(X ) 
that is weak-x closed. Then K* is weak-x compact. If, furthermore, K* is convex, then K* is 
the weak-x closed convex hull of its extreme points. 


Proof Alaoglu’s Theorem tells us that each closed ball in [C(X)]* is weak-x compact. A 
closed subset of a compact topological space is compact. Thus K* is weak-x compact. We 
infer from the Krein-Milman Theorem, applied to the locally convex topological space 
comprising [C(X)]* with its weak-« topology, that if K* is convex, then K* is the weak-x 
closed convex hull of its extreme points. O 


The original Riesz Representation Theorem was proven in 1909 by Frigyes Riesz for 
the dual of C(X), where X = [a, b], a closed, bounded interval of real numbers. The general 
case for X a compact Hausdorff space was proven by Shizuo Kakutani in 1941. There were 
two intermediate theorems: in 1913 Johann Radon proved the theorem for X a cube in 
Euclidean space and in 1937 Stefan Banach proved it for X a compact metric space.° In each 
of these two theorems the representing measure is a finite measure on the Borel sets and 
is unique among such measures: there is no mention of regularity. The following theorem 
explains why this is so. 


Theorem 14 Let X be a compact metric space and ya finite measure on the Borel o-algebra 
B(X). Then p is a Radon measure. 


Proof Define the functional J: C(X) > R by 
(n= | f dw for all f € C(X). 
X 


Then J is a positive linear functional on C,(X) = C(X). The Riesz-Markov Theorem tells 
us that there is Radon measure pg: B( X) — [0, oo) for which 


[ fans | fan forall f € C(x) (14) 
xX xX 


We will show that w = po. First, consider an open set O. For each natural number n, let 
K, = {x € X| disty ~o(x) > 1/n}. Then {K,,} is an ascending sequence of compact subsets 
of O whose union is O. Since X is compact it is locally compact and therefore possesses the 


6 Albrecht Pietsch’s History of Banach Spaces and Functional Analysis [Pie07] contains an informative discussion 
of the antecedents of the general Riesz Representation Theorem. Further interesting historical information is 
contained in the chapter notes of Nelson Dunford and Jacob Schwartz’s Linear Operators, Part I [DS71]. 
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locally compact extension property. Select a sequence {f,} of functions in C(X) for which 
each f, =10n K, and f, =0on X~0O. Substitute f, for f in (14). Then 


O~ Ky 


~ K, 


n 


~ 


We infer from the continuity of the measures jz and yo and the uniform boundedness of the 
fn’s that for every open set O, 


p(O) = tim #( Kn) = im, Ho( Kn) = Bo(Q). 
Now let F be aclosed set. For each natural number n, define 


On = (_J B(x, 1/n). 


xeF 


Then O,, being the union of open balls, is open. On the other hand, since F is compact, 


F=()On. 


n=1 


By the continuity of the measures ws and yo and their equality on open sets, 
w(F) = lim w(On) = lim jo(On) = Hol F). 


We conclude that for every closed set F, u( F) = wo( F). 


Now let E be a Borel set. We leave it as an exercise (see Problem 51) to show that 
the Radon measure pp on the compact metric space X has the following approximation 
property: for each € > 0, there is an open set O, and a closed set F. for which 


F.C ECO, and wo(O, ~ Fe) <e. (15) 
Therefore, by the excision property of measure, 
p(Oc~ Fe) = (Oc) — w( Fe) <. 


From these two estimates we infer that |uo(£) — u( E)| <2-«. Thus the two measures agree 
on the Borel sets and therefore are equal. LJ 


Corollary 15 Let X be a compact metric space and {un: B(X) — [0, 00)} a sequence of 
Borel measures for which the sequence {u,,( X )} is bounded. Then there is a subsequence {tn,} 
and a Borel measure yt for which 


tim | fdiin =f fdu forall f € C(x), 


k—> oo 
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Proof Borsuk’s Theorem tells us that C( X ) is separable. The Riesz Representation Theorem 
and the preceding regularity theorem tell us that all the bounded linear functionals on C(X) 
are given by integration against finite signed Borel measures. The weak-* sequential 
compactness conclusion now follows from Helley’s Theorem. CJ 


In 1909 Frigyes Riesz proved the representation theorem which bears his name for the 
dual of C[a, b], in the following form: For each bounded linear functional L on C[a, b], there 
is a function g: [a, b] — R of bounded variation for which 


_ [ f(x) dg(x) for all f € C[a, b]: 


the integral is in the sense of Riemann-Stieltjes. According to Jordan’s Theorem, any function 
of bounded variation is the difference of increasing functions. Therefore it is interesting, 
given an increasing function g on [a, b], to identify, with respect to the properties of g, the 
unique Borel measure ys for which 


b 
| f(x) dg(x) -| f dw for all f € C[a, Dd]. (16) 
a [a, b| 


For a closed, bounded interval [a, b], let S be the semiring of subsets of [a, b] 
comprising the singleton set {a} together with subintervals of the form (c, d]. Then B[a, 5] 
is the smallest o-algebra containing S. We infer from the uniqueness assertion in the 
Carathéodory-Hahn Theorem that a Borel measure on B[a, b] is uniquely determined 
by its values on S. Therefore the following proposition characterizes the Borel measure 
that represents Lebesgue-Stieltjes integration against a given increasing function. For an 
increasing real-valued function on the closed, bounded interval [a, b] we define, fora<c <b, 


fle") = int f(x) and f(c~) = sup f(x). 


Define f(a*) and f(b~ ) in the obvious manner, and set f(a) = f(a), f(b*) = f() 
The function f is said to be continuous on the right at x € [a, b) provided f(x) = f(xt 


Proposition 16 Let g be an increasing function on the closed, bounded interval [a, b] and p 
the unique Borel measure for which (16) holds. Then {a} = g(a*) — g(a) and 


u(c, d] = g(d*) — g(c*) forall (c, d] C (a, DB]. (17) 

Proof We first verify that 
pic, b] = g(b) — g(c_ ) for allc € (a, DJ. (18) 
Fix a natural number n. The increasing function g is continuous except at a countable number 


of points in [a, b]. Choose a point c, € (a, c) at which g is continuous and c —c, <1/n. Now 
choose a point c}, € (a, c,) at which g is continuous and g(c,) — g(c),) < 1/n. Construct a 


468 Chapter 21 Measure and Topology 


continuous function f, on [a, b] for which 0 < f, <10n[a, b], f, =10n[cp, b] and f, =0 
on [a, c,,]. By the additivity over intervals property of the Riemann-Stieltjes integral, 


b Cn 
[ futs)aelx) = J fexrdg(x) + [e(0) - alc) 
a Ch 
By the additivity of integration with respect to ys over finite disjoint unions of Borel sets, 
fudu= | fadu+ len, b 
[a, B] (chs Cn) 


Substitute f = f, in (16) to conclude that 


f fn(x) dg(x) + [g(b) — g(cn)| =| . fn dp + plcn, 5). (19) 
However, since 0 < f, < 1 on [a, 6], 

[ f(x) dg(x)| < g(cn) —g(c,) <1/n 
and 

J. frdp| < p(cl, cn) < pci, c). 


Take the limit as n — oo in (19) and use the continuity of measure to conclude that (18) 
holds. A similar argument shows that u{a} = g(a*) — g(a) and 


u{c} = g(ct) — g(c~) for all c € (a, b). (20) 
Finally, we infer from (18), (20), and the finite additivity of » that fora<c<d<b, 
p(c, d] = plc, b] — wld, b] — wfc} + w{d} = g(d*) — g(c*). 


The proof is complete. LJ 


We have the following, slightly amended, version of Riesz’s original representation 
theorem from 1909. 


Theorem 17 (Riesz) Let [a, b] be a closed, bounded interval and F the collection of real- 
valued functions on [a, b] that are of bounded variation on [a, b], continuous on the right 
on (a, b), and vanish at a. Then for each bounded linear functional s on C[a, b], there is a 
unique function g belonging to F for which 


b 
WN) = | f(2)dglx) forall f € Cla, (21) 
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Proof To establish existence, it suffices, by the Riesz-Markov Theorem, to do so for w 
a positive bounded linear functional on C[a, b]. For such a y, the Riesz Representation 
Theorem tells us there is a Borel measure pu for which 


b 
w(f) =| Ff dy for all f E C[a, Dd}. 


Consider the increasing real-valued function g defined on [a, b] by g(a) = 0 and g(x) = 
p(a, x] + w{a} for x € (a, b]. The functon g inherits continuity on the right at each point in 
(a, b) from the continuity of the measure y. Thus g belongs to ¥. We infer from Proposition 
16 that 


b b 
[ sai= J $(x)dg(x) forall f € Cle, 6) 
where is the unique Borel measure on B[a, b] for which 
p(c, b] = g(b) — g(ct) = g(b) — g(c) for all c € (a, b) and p{a} = g(a™) — g(a). 


However, the measure yu has these properties. This completes the proof of existence. 


To establish uniqueness, by Jordan’s Theorem regarding the expression of a function 
of bounded variation as the difference of increasing functions, it suffices to let g;, g2 €¢ F be 
increasing functions which have the property that 


b b 
w(f) =| f(x) dg1(x) =| f(x) dgo(x) for all f € C[a, b], 
and show that g] = go. Take f =1 in this integral equality to conclude that 


1(b) = g1(b) — 91(@) = g2(b) — g2(a) = g0(d). 


Let & be represented by integration against the Borel measure w. We infer from Proposition 
16 and the right continuity of g; and go at each point in (a, b) that if x belongs to (a, b), 
then 


gi(b) — g1(x) = w(x, b] = go(b) — g2(x), 
and hence g1(x) = go(x). Therefore g; = g2 on [a, DI. a 


PROBLEMS 


46. Let xo be a point in the compact Hausdorff space X. Define L( f) = f(x) foreach f € C(X). 
Show that L is a bounded linear functional on C(X). Find the signed Radon measure that 
represents L. , 


47. Let X be a compact Hausdorff space and . a Borel measure on B( X). Show that there is a 
Radon measure po for which 


[ sau = | fauo forall fin c(x), 
Xx Xx 


470 Chapter 21 Measure and Topology 


48. 


49. 


50. 


51. 


52. 


53. 


Let g; and g» be two increasing functions on the closed, bounded interval [a, b] that agree at 
the end-points. Show that 


b b 
| f(x) dgi(x) =| f(x) dg2(x) for all f € C[a, b] 


if and only if g}(x*) = g(x) foralla<x<b. 


Let X be a compact Hausdorff space. Show that the Jordan Decomposition Theorem for 
signed Borel measures on B( X ) follows from the Riesz Representation Theorem for the dual 
of C(X) and Proposition 12. 


What are the extreme points of the unit ball of the linear space of signed Radon measures 
Radon(X), where X is a compact Hausdorff space? 


Verify (15) for E a Borel subset of a compact metric space X and uw a Radon measure on 
B(X). 
Let X be a compact metric space. On the linear space of functions F defined in the statement 


of Theorem 17, define the norm of a function to be its total variation. Show that with this 
norm ¥ is a Banach space. 


(Alternate proof of the Stone-Weierstrass Theorem (de Branges)) Let A be an algebra of 

real-valued continuous functions on a compact space X that separates points and contains 

the constants. Let A* be the set of signed Radon measures on X such that |u|(X) < 1 and 

Sy fdu =O for all f € A. 

(i) Use the Hahn-Banach Theorem and the Riesz Representation Theorem to show that if 
A* contains only the zero measure, then A = C(X). 


(i) Use the Krein-Milman Theorem and the weak-« compactness of the unit ball in 
Radon(X) to show that if the zero measure is the only extreme point of A+, then A+ 
contains only the zero measure. 


(iit) Let w be an extreme point of A+. Let f belong to A, with 0 < f < 1. Define measures 
py and pz by 


14(E) = [ fdwand p2(E) = i (1— f) dp for E € B(X). 


Show that yj; and j12 belong to A* and, moreover, ||44|| + ||u2|| = lll], and w1 + 2 = pw. 
Since pw is an extreme point, conclude that 41 = cy for some constant c. 


(iv) Show that f = c on the support of py. 


(v) Since A separates points, show that the support of 4 can contain at most one point. Since 
fy 1du = 0, conclude that the support of u is empty and hence p is the zero measure. 


21.6 REGULARITY PROPERTIES OF BAIRE MEASURES 


Definition Let X be a topological space. The Baire o-algebra, which is denoted by Ba(X), 
is defined to be the smallest o-algebra of subsets of X for which the functions in C,(X) are 
measurable. 


Evidently 


Ba(X) C B(X). 
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There are compact Hausdorff spaces for which this inclusion is strict (see Problem 58). The 
forthcoming Theorem 20 tells us that these two o-algebras are equal if X is a compact metric 
space. A measure on Ba( X) is called a Baire measure provided it is finite on compact sets. 
Given a Borel measure pw on the Borel o-algebra B( X), we define yo to be the restriction 
of w to the Baire o-algebra Ba(X). Then pp is a Baire measure. Moreover, each function 
f € C,(X) is integrable over X with respect to ip since it is measurable with respect to 
Ba( X ), bounded, and vanishes outside a set of finite measure. Since Ba( X) C B(X), 


[ tau = | fayo for at f € C(X). (22) 
X X 


We will establish regularity properties for Baire measures from which we obtain finer 
uniqueness properties for Baire representations than are possible for Borel representations 
in the Riesz-Markov and Riesz Representation Theorems. 

Let X be a topological space, S a o-algebra of subsets of X, and w: S > [0, co] a 
measure. A set E € Sis said to be outer regular provided 


w(E) = inf {u(O)| Oopen, Oc S, ECO} 
and said to be inner regular provided 
w(E) =sup {u(K)| K compact, Ke S, KCE}. 


A set that is both inner and outer regular is called regular with respect to 4. The measure 
p: S — [0, oo] is called regular provided each set in S is regular. 

We showed that Lebesgue measure on the Euclidean space R” is regular. We defined 
a Borel measure to be a Radon provided each Borel set is outer regular and each open set is 
inner regular. 


Proposition 18 Let X be a locally compact Hausdorff space and jr, and jz be two regular 
Baire measures on Ba( X ). Suppose 


[ fam = | f dpy for all f €C.(X). 
x x 
Then py = p22. 


Proof The proof is exactly the same as the corresponding uniqueness result for integration 
with respect to Radon measures. UO 


Proposition 19 Let X be a compact Hausdorff space, S a a-algebra of subsets of X, and 
pu: S — [0, 00) a finite measure. Then the collection of sets in S that are regular with respect 
to p is a o-algebra. 


Proof Define F to be the collection of sets in S that are regular with respect to w. Since X is 
compact and Hausdorff, a subset of X is open if and only if its complement in X is compact. 
Thus, since p is finite, by the excision property of measure, a set belongs to ¥ if and only if 
its complement in X belongs to F. We leave it as an exercise to show that the union of two 
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regular sets is regular. Therefore the regular sets are closed with respect to the formation 
of finite unions, finite intersections, and relative complements. It remains to show that F is 
closed with respect to the formation of countable unions. Let E = Ure, En, where each E,, 
is a regular set. By replacing each E, by E, ~ Ur E;, we may suppose that the E,,’s are 
disjoint. Let ¢ > 0. For each n, by the outer regularity of E,,, we may choose a neighborhood 
O,, of E,, which belongs to S and u(O,) < w(E,) + €/2”. Define O = U%, On. Then O is a 
neighborhood of E, O belongs to S, and, since 


oO 
O~EC\|J[On~ Kn], 
n=1 
by the excision and countable monotonicity properties of the measure jp, 


MO) - w(E) = w(O~E) < S w(On~ En) <e 


n=1 


Thus E£ is outer regular. A similar argument established inner regularity of E. This completes 
the proof. a 


Theorem 20 Let X be a compact Hausdorff space in which every closed set is a Gs Set. 
Then the Borel o-algebra equals the Baire o-algebra and every Borel measure is regular. In 
particular, if X is a compact metric space, then the Borel o-algebra equals the Baire o-algebra 
and every Borel measure is regular. 


Proof To show that the Baire o-algebra equals the Borel o-algebra, it is necessary and 
sufficient to show that every closed set is a Baire set. Let K be a closed subset of X. 
Then K is compact and, by assumption, is a Gs set. According to Proposition 4, there is a 
function f € C.(X) for which K = {x € X| f(x) = 1}. Since f belongs to C,(X), the set 
{x € X| f(x) = 1} is a Baire set. 


Let be a Borel measure on B(X). The preceding proposition tells us that the 
collection of regular Borel sets is a o-algebra. Therefore, to establish the regularity of B( X) 
it is necessary and sufficient to show that every closed set is regular with respect to the Borel 
o-algebra. Let K be a closed subset of X. Then K is compact since X is compact and thus K 
is inner regular. Since K is a Gg set and u( X) < 00, we infer from the continuity of measure 
that K is outer regular with respect to the Borel o-algebra. 


To conclude the proof, assume X is a compact metric space. Let K be a closed subset of 
X. We will show that K is a Gs set. Let n be a natural number. Define O, = U;- KB(x, 1/n). 
Then O is a neighborhood of the compact set K. According to the locally compact extension 
property, the function that takes the value 1 on K may be extended to a function f, € C.(X) 
that has support contained in O,. Define U/, = f'(—1/n, 1/n). Then U,, is an open Baire 
set. By the compactness of K, K = U°°_,U,. We infer from the continuity of measure that K 
is outer regular. LJ 


In the preceding section we used the Riesz-Markov Theorem to show that if X is a 
compact metric space, then every Borel measure on B(X) is a Radon measure. 
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Proposition 21 Let X be locally compact Hausdorff space. The Baire o-algebra Ba( X ) is the 
smallest o-algebra that contains all the compact G35 subsets of X. 


Proof Define F to be the smallest o-algebra that contains all the compact Gs sets. Let K 
be a compact Gs set. According to Proposition 4 there is a function f € C,(X) for which 
K = {x € X| f(x) = 1}. Therefore K belongs to F. Thus F C Ba(X). To establish the 
inclusion in the opposite direction we let f belong to C,( X ) and show it is measurable with 
respect to the Baire o-algebra. For a closed, bounded interval [a, b] that does not contain 
0, f~'[a, b] is compact and equal to N°, (a —1/n, b+1/n) since f is continuous and 
has compact support. Since F is closed with respect to the formation of countable unions, 
f—'(J) also belongs to F if J is any interval that does not contain 0. Finally, since 


f-'{0} = X~[f7!(—00, 0) U f7!(0, 00)] 


we infer that the inverse image under f of any nonempty interval belongs to F and therefore 
f is measurable with respect to the Baire o-algebra. U 


Proposition 22 Let X be a compact Hausdorff space. Then every Baire measure on Ba( X) 
is regular. 


Proof Let y be a Baire measure on Ba(X). Proposition 19 tells us that the collection of 
subsets of Ba( X ) that are regular with respect to u is a c-algebra. We infer from Proposition 
21 that to prove the proposition it is sufficient to show that each compact Gs subset K of X 
is regular. Let K be such a set. Clearly K is inner regular. Since u( X ) < oo and K is a Gs set, 
by the continuity of measure, K is outer regular. a 


We have the following small improvement regarding uniqueness of the Riesz Repre- 
sentation Theorem. 


Theorem 23 Let X be a compact Hausdorff space and I: C(X) —> R a bounded linear 
functional. Then there is a unique signed Baire measure yp. for which 


(A) = | fay forall f eC(X), 


Proof The Riesz Representation Theorem tells us that J is given by integration against a 
signed Radon measure p’ on the Borel subsets of X. Let yw be the restriction of v’ to the Baire 
o-algebra. Then, arguing as we did in establishing (22), integration against yz. represents J. 
The uniqueness assertion follows from Proposition 18 and the preceding regularity result. L] 


Definition A topological space X is said to be o-compact provided it is the countable union 
of compact subsets. 


Each Euclidean space R” is o-compact. The discrete topology on an uncountable space 
is not o-compact. Our final goal of this chapter is to prove regularity for Baire measures 
on a locally compact, o-compact Hausdorff space. To that end we need the following three 
lemmas whose proof we leave as exercises. 
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Lemma 24 Let X be a locally compact Hausdorff space and F C X a closed Baire set. Then 
forACF, 
A € Ba(X) if and only if A € Ba( F). 


Lemma 25 Let X be a locally compact Hausdorff space and E C X a Baire set that has 
compact closure. Then E is regular with respect to any Baire measure p on Ba(X). 


Lemma 26 Let X be a locally compact, c-compact Hausdorff space and A C X a Baire set. 
Then A =\|Jy_, Ax where each Aj is a Baire set that has compact closure. 


Theorem 27 Let X be a locally compact, c-compact Hausdorff space. Then every Baire 
measure on Ba(X) is regular. 


Proof Since X is locally compact, Lemma 25 tells us that any Baire set of compact closure 
is regular. Moreover, by the preceding lemma, since X is o-compact, every Baire set is the 
union of a countable collection of Baire sets each of which has compact closure. Therefore 
to complete the proof it is sufficient to show that the countable union of Baire sets, each of 
which has compact closure, is regular. 


Let E = U??, Ex, where each E; is a Baire set of compact closure. Since the Baire 
sets are an algebra, we may suppose that the E;’s are disjoint. Let « > 0. For each k, by the 
regularity of E;, we may choose Baire sets K;, and ©;, with K, compact and ©; open, for 
which 

K, C Ex C Ox 
and 
w( Ex) —€/2* <pw( Kx) < w(Or) < w( Ex) + €/2*. 


If u( E) = oo then, of course, E is outer regular. Moreover, since 
OO OO n 
uy \_) Ex = p(E) = ooand pw _) Kx = lim ph _) Kx 
k=l k=1 re O \k=1 
E contains compact Baire sets of the form Un, Kx, which have arbitrarily large measure 
and therefore E is inner regular. 
Now suppose that ( E) < oo. Then O = UP’, O; is again an open Baire set and since 
(o.@) 
O~EC|)[O. ~ Ex], 
k=1 
by the countable monotonicity and excision properties of measure, 
(o.@) 
w(O) — w(E) = p(O~E) < >) w(Oe~ Ex) <e. 
k=1 


Thus E is outer regular. To establish inner regularity observe that 


N N 
p(E) = gin, 2H Ex) < slim, 2, HK) +e 
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Thus £ contains compact Baire subsets of the form U_, Kx, which have measure arbitrarily 
close to the measure of E. Therefore E is inner regular. a 


We have the following small improvement regarding uniqueness of the Riesz-Markov 
Theorem for o-compact spaces. 


Theorem 28 Let X be a locally compact, o-compact Hausdorff space and I: C.(X) > Ra 
positive linear functional. Then there is a unique Baire measure p for which 


I(f) = | fu forall f & C.(X). 


The reader should be warned that standard terminology regarding sets and measures 
that are either Baire or Borel has not been established. Neither has the terminology regarding 
Radon measures. Some authors take the class of Baire sets to be the smallest o-algebra for 
which all continuous real-valued functions on X are measurable. Others do not assume every 
Borel or Baire measure is finite on every compact set. Others restrict the class of Borel sets to 
be the smallest o-algebra that contains the compact sets. Authors (such as Halmos [Hal50]) 
who do measure theory on o-rings rather than o-algebras often take the Baire sets to be 
the smallest o-ring containing the compact G»’s and the Borel sets to be the smallest o-ring 
containing the compact sets. In reading works dealing with Baire and Borel sets or measures 
and Radon measures, it is imperative to check carefully the author’s definitions. A given 
statement may be true for one usage and false for another. 


PROBLEMS 
54, Let X be a separable compact Hausdorff space. Show that every closed set is a Gs set. 


55. Let X be a Hausdorff space and w: B(X) > [0, oo] a o-finite Borel measure. Show that p is 
Radon if and only if it is regular. 


56. Show that a Hausdorff space X is both locally compact and o-compact if and only if there is 
an ascending countable collection {O;,}?°, of open subsets of X that covers X and for each k, 


Oy; is a compact subet of Ox4.1. 


57. Let xo be a point in the locally compact Hausdorff space X. Is the Dirac delta measure 
concentrated at xo, 6,,, a regular Baire measure? 


58. Let X be an uncountable set with the discrete topology and X* its Alexandroff compactification 
with x* the point at infinity. Show that the singleton set {x*} is a Borel set that is not a Baire 
set. 


59. Let X bea locally compact Hausdorff space. Show that a Borel measure w.: B(X) — [0, oo] is 
Radon if and only if every Borel set is measurable with respect to the Carathéodory measure 
induced by the premeasure yu: B(X) — [0, oo]. 


60. Prove Lemmas 24, 25, and 26. 
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61. Let X be a compact Hausdorff space and f;,..., f, continuous real-valued functions on X. 
Let v be a signed Radon measure on X with |»|(X) <1 and let ¢; = J, fidv, for1 <i<n. 


(i) Show that there is a signed Radon measure pw on X with |u|(X) < 1 for which 
[ fidu=e 
xX 


[ edu | gaatorall ge C(x) 
X X 


for any signed Radon measure A with |A|(X) < landsuch that f, fidA =c; forl <i <n. 


(ii) Suppose that there is a Radon measure v on X with v(X) = land f, fidv =cj,1 <i <n. 
Show that there is a Radon measure yz on X with p(X) = Land fy fidu = cj, for1 < 
i <n, which minimizes [{ x 8d among all Radon measures that satisfy these conditions. 
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A topological group is a group G together with a Hausdorff topology on G for which the 
group operation and inversion are continuous. We prove a seminal theorem of John von 
Neumann which tells us that on any compact topological group G there is a unique measure 
pon the Borel c- algebra B(G), called Haar measure, that is invariant under the left action 
of the group, that is, 

L(g: E) =p(E) for all g € G, E € B(G). 


Uniqueness follows from Fubini’s Theorem; existence is a consequence of a fixed point 
theorem of Shizuo Kakutani which asserts that for a compact group G, there is a functional 
w €[C(G)]* for which 


olf =1] = 1 and y[x+> f(x)] = pfx f(g -x)] for all g € G, f € C(G). 


Alaoglu’s Theorem is crucial in the proof of this fixed point theorem. Details of the proof 
of the existence of Haar measure are framed in the context of a group homomorphism of G 
into the general linear group of [C(G)]*. We also consider mappings f of a compact metric 
space X into itself and finite measures on B( X). Based on Helley’s Theorem, we prove the 
Bogoliubov-Krilov Theorem which tells us that if f is a continuous mapping on a compact 
metric space X, then there is a measure yz on B(X) for which 


mx) =1and f oo fdu= [edu torall y € Lx j). 
x x 


Based on the Krein-Milman Theorem, we prove that the above w may be chosen so that f is 
ergodic with respect to y, that is, if A belongs to B( X) and w([A~f(A)] U[f(A)~A4]) = 0, 
then u(A) = 0 or w(A) = 1. 


22.1 TOPOLOGICAL GROUPS: THE GENERAL LINEAR GROUP 


Consider a group G together with a Hausdorff topology on G. For two members £1, 22 Of G, 
denote the group operation by g - 29, denote the inverse of a member g of the group by 
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1 and let e be the identity of the group. We say that G is a topological group provided 
the mapping (81, 82) 8; - g2 is continuous from G XG to G, where G XG has the product 
topology, and the mapping g+> g™! is continuous from G to G. By a.compact group we 
mean a topological group that is compact as a topological space. For subsets G, and G2 of G, 
we define G; - G2 = {81 - g2lai € Gi, g2 € Go} and Gy" = {g"'|g € Gj}. If G, has just one 
member g, we denote {g} - G2 by g - Go. 

Let E be a Banach space and £(E) the Banach space of continuous linear operators 
on E.' The composition of two operators in £(E) also belongs to £(E) and clearly, for 
operators T, Sé€ L(E), 

ISoTl] < USI - ITT. (1) 


Define GL(E) to be the collection of invertible operators in £(E). An operator in £(E) 
is invertible if and only if it is one-to-one and onto; the inverse is continuous by the Open 
Mapping Theorem. Observe that for T, S € GL(E), (So T)~! = T~! 0 S“!. Therefore, 
under the operation of composition, GL( E) is a group called the general linear group of E. 
We denote its identity element by Id. It also is a topological space with the topology induced 
by the operator norm. 


Lemmal Let E be a Banach space and the operator C € L( E) have ||C|| <1. Then Id —C is 
invertible and 
(Id—C)“"}] < (1- ICI). (2) 


Proof We infer from (1) that for each natural number k, ||C*|| < ||C||*. Hence, since 
Cll <1, the series of real numbers }° , \|C*|| converges. The normed linear space L( E) 


is complete. Therefore the series* of operators 5, C* converges in £(E) to a continuous 
linear operator. But observe that 


(Id—C) (Sct) = (Yee (Id —C) = Id—c"*" for all n. 
k=0 


Therefore the series 3, C* converges to the inverse of Id —C. The estimate (2) follows 
from this series representation of the inverse of Id —C. a 


Theorem 2 Let E be a Banach space. Then the general linear group of E, GL(E), is a 
topological group with respect to the group operation of composition and the topology induced 
by the operator norm on L(E). 


Proof For operators T, T’, S, S’ in GL( E), observe that 
ToS—T’oS’ =To(S—S’)+(T-T')oS’. 
Therefore, by the triangle inequality for the operator norm and inequality (1), 


IToS-ToS' <TH. WS-S +7 -T'- US‘. 


Recall that the operator norm, ||7'||, of T € C(E) is defined by ||7 || = sup {|T(x) || |x € E, |x| < 1}. 


(o@) . 
2The series S C* is called the Neumann series for the inverse of J — C. 
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The continuity of composition follows from this inequality. 
If S belongs to GL( E) and ||S — Id || <1, then from the identity 
S~! —Id = (Id—S)S~! = (Id—S)[Id-(1 - S)}“}, 


together with the inequalites (1) and (2), we infer that 


: |S —Id 
S~! — Td || < ————_. 3 
I< 75-44) (3) 


Therefore inversion is continuous at the identity. Now let T and S belong to GL( E) and 
(S — TI] <|7—1|-!. Then 


\T~'S — Id = |" (S—T) | < ITT WS - TH <1. 


Thus, if we substitute T—'S for S in (3) we have 


_ 7's —Id|| 
S17 ~Id|| < $$ 
I<7C 7-15 —Id| 


From this inequality and the identities 
S-'—7 =($"'7 -Id)T™ and T~!5 — Id = T"1(S —-T) 


we infer that 


T |? -|IT-S 
st rty ¢ ITE T= Sh 
1—||T~*]- 7 — SI 
The continuity of inversion at T follows from this inequality. a 


In the case E is the Euclidean space R”, GL( E) is denoted by GL(n, R). If a choice 
of basis is made for R”, then the topology on GL(n, R) is the topology imposed by the 
requirement that each of the n Xn entries of the matrix representing the operator with 
respect to this basis is a continuous function. 

A subgroup of a topological group with the subspace topology is also a topological 
group. For example, if H is a Hilbert space, then the subset of GL( H) consisting of those 
operators that leave invariant the inner product is a topological group that is called the 
orthogonal linear group of H and denoted by O( H). 


PROBLEMS 
In the following exercises, G is a topological group with unit element e and E is a Banach 
space. 
1. If T, is a base for the topology at e, show that {g-O|O € T¢} is a base for the topology at 
geG. 
2. Show that K, - Kz is compact if K,; and K2 are compact subsets of G. 


3. Let O be a neighborhood of e. Show that there is also a neighborhood U of e for which 
U=U'andu-uco. 
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4. Show that the closure H of a subgroup H is a subgroup of G. 


5. Let G, and G2 be topological groups and h: G, —> G2 a group homomorphism. Show that h is 
continuous if and only if it is continuous at the identity element of G). 


6. Use the Contraction Mapping Principle to prove Lemma 1. 


7. Use the completeness of £( E) to show that if C € £( E) and ||C|| <1, then $°, C* converges 
in £(E). 

8. Show that the set of n Xn invertible real matrices with determinant 1 is a topological group 
if the group operation is matrix multiplication and the topology is entrywise continuity. This 
topological group is called the special linear group and denoted by SL(n, R). 


9. Let H be a Hilbert space. Show that an operator in GL( H) preserves the norm if and only if 
it preserves the inner product. 


10. Consider R” with the Euclidean inner product and norm. Characterize those n Xn matrices 
that represent orthogonal operators with respect to an orthonormal basis. 


11. Show that GL( £) is open in £(E). 


12. Show that the set of operators in GL(E) comprising operators that are linear compact 
perturbations of the identity is a subgroup of GL( E£). It is denoted by GL¢(E). 


22.2 KAKUTANI’S FIXED POINT THEOREM 


For two groups G and H, a mapping ¢: G > H is called a group homomorphism provided 
for each pair of elements g1, g2 inG, 9(g1 - 82) = o(g1) - o( 82). 


Definition Let G be a topological group and E a Banach space. A group homomorphism 
a: G—> GL(E) is called a representation of G on E. 


As usual, for a Banach space E, its dual space, the Banach space of bounded linear 
functionals on E, is denoted by E*. We recall that the weak-+ topology on E* is the topology 
with the fewest number of sets among the topologies on E* such that, for each x € E, the 
functional on E* defined by y+» (x) is continuous. Alaoglu’s Theorem tells us that the 
closed unit ball of E* is compact with respect to the weak-x topology. 


Definition Let G be a topological group, E a Banach space, and 7: G > GL(E) a represen- 
tation of G on E. The adjoint representation 7*: G — GL( E*) is a representation of G on E* 
defined for g € G by 

w*(g) = omg”) for ally € E*. (4) 


We leave it as an exercise to verify that 7* is a group homomorphism. 

Recall that a gauge or Minkowski functional on a vector space V is a positively 
homogeneous, subadditive functional p: V > R. Such functionals determine a base at the 
origin for the topology of a locally convex topological vector space V. In the presence of a 
representation 7 of a compact group G on a Banach space E, the following lemma establishes 
the existence of a family, parametrized by G, of positively homogeneous, subadditive 
functionals on E*, each of which is invariant under 7* and, when restricted to bounded 
subsets of E*, is continuous with respect to the weak-x topology. 


3Observe that no continuity assumption is made regarding a representation. It is convenient to view it as a purely 
algebraic object and impose continuity assumptions as they are required in a particular context. 
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Lemma 3_ Let G be a compact group, E a Banach space, and 7: G > GL(E) arepresentation 
of G on E. Let xq belong to E and assume that the mapping g +> m(g)xo is continuous from G 
to E, where E has the norm topology. Define p: E* + R by 


P(t) = sup lys(a(g)x0)| for p € E*.. 
RE 


Then p is a positively homogeneous, subadditive functional on E*. It is invariant under 7r*, 
that is, 


P(m"(g)W) = p(y) for all p < E* and g € G. 


Furthermore, the restriction of p to any bounded subset of E* is continuous with respect to the 
weak-x topology on E*. 


Proof Since G is compact and, for p € G, the functional g-> #(m(g)xo) is continuous on 
G, p: E* — Ris properly defined. It is clear that p is positively homogeneous, subadditive, 
and invariant with respect to 7*. Let B* be a bounded subset of E*. To establish the weak-« 
continuity at p: B* > R, it suffices to show that for each wo € B* and € > 0, there is a weak-« 
neighborhood N (io) of Wo for which 


(arg )x0) — bo(m(g)x0)| < € for all p € N (fo) 1 B* and g « G. (5) 


Let i belong to B* and « > 0. Choose M > 0 such that ||y|| < M for all f €¢ B*. The mapping 
g+>m(g)xo is continuous and G is compact. Therefore there are a finite number of points 
{g1,---,8n} in G and for each k,1 < k <n, a neighborhood O, of g, such that {Og} 
covers G and, for 1 <k <n, 


Il7(g)xo — 1( gx )xoll < €/4M for all g € O,,. (6) 
Define the weak-+ neighborhood N (Jo) of Wo by 


N (ho) = {Ww € E* | |(b — bo)(( gx )x0)| < €/2 for 1 <k <n}. 
Observe that for any g € G, wy €¢ E* and1 <k <n, 


W(a(g)x0) — bol m(g)x0) = (b — Yo )[ar( ge x0] + (4 — bo) [t(g)x0 — 7(gx)xo]- (7) 


To verify (5), let g belong to G and w belong to N(w%o) N B*. Choose k, 1 < k <n, for which 
g belongs to Ox. Then |(¢ — Yo )[m( gx )x0]| < €/2 since w belongs to N(wo). On the other 
hand, since || — Wl] < 2M, we infer from (6) that |(y — wo) [a(g)xo — (gx) x0]| < €/2. 
Therefore, by (7), (5) holds for NV (wo). C 


Definition Let G be a topological group, E a Banach space, and 7:G > GL(E) a 
representation of G on E. A subset K of E is said to be invariant under 7 provided 
m™(g)(K) CK forall g € G. A point x € E is said to be fixed under 7 provided m(g)x = x for 
allg éG. 


Theorem4 LetG be acompact group, Ea Banach space, and 7: G > GL(E) arepresentation 
of G on E. Assume that for each x € E, the mapping g+> 1(g)x is continuous from G to E, 
where E has the norm topology. Assume there is a nonempty, convex, weak-* compact subset 
K* of E* that is invariant under a*. Then there is a functional ys in K* that is fixed under n*. 
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Proof Let 7 be the collection of all nonempty, convex, weak-* closed subsets of K* that 
are invariant under 7*. The collection F is nonempty since K* belongs to F. Order F by 
set inclusion. This defines a partial ordering on ¥. Every totally ordered subcollection of F 
has the finite intersection property. But for any compact topological space, a collection of 
nonempty closed subsets that has the finite intersection property has nonempty intersection. 
The intersection of any collection of convex sets is convex and the intersection of any 
collection of 7* invariant sets is 7* invariant. Therefore every totally ordered subcollection 
of F has its nonempty intersection as a lower bound. We infer from Zorn’s Lemma that 
there is a set K) in F that is minimal with respect to containment, that is, no proper subset 
of Kj belongs to ¥. This minimal subset is weak-* closed and therefore weak-« compact. 
We relabel and assume K* itself is this minimal subset. 


We claim that K* consists of a single functional. Otherwise, choose two distinct 
functionals yy, and 2 in K*. Choose x € E such that (x9 ) #W(x0). Define the functional 
p: K* > Rby 

P(p) = sup ly(ar(g)x0)| for p € K*. 
8e 


Since K* is weak-* compact, the Uniform Boundedness Principle tells us that K* is bounded. 
According to the preceding lemma, p is continuous with respect to the weak-* topology. 
Therefore, if, for r > 0 and n € K*, we define 


Bo(n, r) = {we K* | p(w—n) <r} and Bo(n, r) = {we K*| p(p—n)<r}, (8) 


then Bo(7, 7) is open with respect to the weak-+ topology on K* and Bo( 7, r) is closed with 
respect to the same topology. Each of these sets is convex since, again by the preceding 
lemma, p is positively homogeneous and subadditive. 


Define d = sup {p(w — ¢)|W, 9 € K*}. Then d is finite since p is continuous on the 
weak-* compact set K*, and d > 0 since p(y — Ww) > 0. Since K* is weak-* compact and 
each Bo(7, r) is weak-* open, we may choose a finite subset {ih; }p—1 of K* for which 


K* = UW Bo( px, d/2). 
kal 


Define 
Wp t... t+... thy 


* 
v= n 
The functional * belongs to K* since K* is convex. Let & be any functional in K*. By the 
definition of d, p(y —p,) < d for 1 <k <n. Since {Bo(px, d/2)}?_, covers K*, belongs to 
some Bo(w;, d/2.) for some ko. Thus, by the positive homogeneity and subadditivity: of p, 


—1 d 
p(W-y*) sd! whered! = "——-d+> <d. 


Define 7 
K'= () Bo(¥, a’). 


peKk* 


Then K’ is a weak-* closed, and hence weak-* compact, convex subset of K*. It is nonempty 
since it contains the functiorial y*. We claim that K’ is invariant under 7*. To verify this, for 
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n € K', b € K* and g € G, we must show that p(a*(g)n — %) < d’. Since pis 7* invariant 
and p(n -— 7*(g7')b) <d’, 

p(m*(g)n—b) = p(n—a*(g"' )) <d’. 
By the minimality of K*, K* = K’. This is a contradiction since, by the definition of d, there 
are functionals / and #” in K* for which p(y’ — Ww”) > d’ and hence ” does not belong 


to Bo(’, d’). We infer from this contradiction that K* consists of a single functional. The 
proof is complete. a 


Definition Let G be a compact group and C(G) the Banach space of continuous real-valued 
functions on G, normed by the maximum norm. By the regular representation of G on C(G) 
we mean the representation 7: G > GL(C(G)) defined by 


[w(g) f(x) = f(g! -x) forall f € C(G),x€ Gand g eG. 


We leave it as an exercise to show that the regular representation is indeed a 
representation. The following lemma shows that the regular representation of a compact 
group G on C(G) possesses the continuity property imposed in Theorem 4. 


Lemma 5 Let G be a compact group and 7: G > GL(C(G)) the regular representation of G 
on C(G). Then for each f € C(G), the mapping g+> 7(g) f is continuous from G to C(G), 
where C(G) has the topology induced by the maximum norm. 


Proof Let f belong to C(G). It suffices to check that the mapping g+—> 7(g) f is continuous 
at the identity e ¢ G. Let « > 0. We claim that there is a neighborhood of the identity, U/, for 
which 


lf(g-x) — f(x)| <€for all g EU, xe G. (9) 
Let x belong to G. Choose a neighborhood of x, O,, for which 
| f(x’) — f(x)| < €/2 for all x’ € Oy. 

Thus 

| f(x’) — f(x")| < for all x’, x” € O,. (10) 
By the continuity of the group operation, we may choose a neighborhood of the identity, 
U,, and a neighborhood x, V,, for which V, C O, and UU, - Vx € O;. By the compactness 
of G, there is a finite collection {V,,}/_, that covers G. Define U = 1y_, Ux, Then U is a 
neighborhood of the identity in G. We claim that (9) holds for this choice of U/. Indeed, let g 
belong in U and x belong to G. Then x belongs to some Y,,. Hence 

xEVy, CO, andg-xEUXYV,, CUy, X Vx, € Ox,. 
Therefore both x and g- x belong to O,, so that, by (10), |f(g-x) — f(x)| < €. Thus (9) is 
established. Replace U by UN U-!. Therefore 
f(g! -x)— f(x)| <eforall geU,xeG, 

that is, 

Ir(g) f — 7(€) fllmax <€ for all g € U. 
This establishes the required continuity. UL 
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For G a compact group, we call a functional w € [C(G)]* a probability functional 
provided it takes the value 1 at the constant function f = 1 and is positive in the sense that 
for f e C(G), if f > OonG, then p( f) > 0. 


Theorem 6 (Kakutani) Let G be a compact group and 7:G > GL(C(G)) the regular 
representation of G on C(G). Then there is a probability functional w € [C(G)]* that is fixed 
under the adjoint action 1*, that is, | 


W(f) = o(a(g)f) forall f € C(G) and g €G. (11) 


Proof According to Alaoglu’s Theorem, the closed unit ball of [C(G)]* is weak-« compact. 
Let K* be the collection of positive probability functionals on C(G). Observe that if p is a 
probability functional and f belongs to C(G) with || fllmax < 1, then, by the positivity and 
linearity of w, since —1 < f <1, 


—-1=(-1) <¥(f) s (1) =1. 


Thus |( f)| < 1 and hence |||] < 1. Therefore K* is a convex subset of the closed unit ball 
of E*. We claim that K* is weak-x closed. Indeed, for each nonnegative function f € C(G), 
the set (y € [C(G)]* | w( f) = 0} is weak-x closed, as is the set of functionals yw that take the 
value 1 at the constant function f =1. The set K* is therefore the intersection of weak-* 
closed sets and so it is weak-x closed. As a closed subset of a compact set, K* is weak-x 
compact. Finally, the set K* is nonempty since if xo is any point in G, the Dirac functional 
that takes the value f(xq) at each f € C(G) belongs to K*. 


It is clear that K* is invariant under 7*. The preceding lemma tells us that the regular 
representation possesses the continuity required to apply Theorem 4. According to that 
theorem, there is a functional in y € K* that is fixed under 7*, that is, (11) holds. O 


PROBLEMS 
13. Show that the adjoint of a representation also is a representation. 
14. Show that a probability functional has norm 1. 


15. Let E be a reflexive Banach space and K* a convex subset of E* that is closed with respect to 
the metric induced by the norm. Show that K* is weak-« closed. On the other hand, show that 
if E is not reflexive, then the image of the closed unit ball of E under the natural embedding 
of E in ( E*)* = E** is a subset of E** that is convex, closed and bounded with respect to the 
metric induced by the norm but is not weak-« closed. 


16. Let G be a compact group, E a reflexive Banach space, and 7: G> GL( E) a representation. 
Suppose that for each x € E, the mapping g+> a(g)x is continuous. Assume there is a 
nonempty strongly closed, bounded, convex subset K of E that is invariant with respect to 7. 
Show that K contains a point that is fixed by 7. 


17. Let G be a topological group, E be a Banach space, and 7: G -> GL( E) a representation. For 
x € E, show that the mapping g+> 7(g)x is continuous if and only if it is continuous at e. 


18. Suppose G is a topological group, X a topological space, and : GX X > Xa mapping. For 
g €G, define the mapping 7(g): X + X by m(g)x = 9(g, x) for all x € X. What properties 
must ¢ possess in order for 7 to be a representation on G on C(X)? What further properties 
must ¢ possess in order that for each x € X, the mapping g+> 7(g)x is continuous? 
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22.3 INVARIANT BOREL MEASURES ON COMPACT GROUPS: 
~VON NEUMANN’S THEOREM 


A Borel measure on a compact topological space X is a finite measure on B(X), the smallest 
g-algebra that contains the topology on X. We now consider Borel measures on compact 
groups and their relation to the group operation. 


Lemma 7 Let G be a compact group and pa Borel measure on B(G). For g € G, define the 
set function p,: B(G) — [0, oo) by 


yg(A) = w(g- A) for all A € B(G). 


Then jg is a Borel measure. If 1 is Radon, so is jtg. Furthermore, if w is the regular 
representation of G on C(G),* then 


[ w(e)fdu= | fdye fora f € C(G), (12) 
G G 


Proof Let g belong to G. Observe that multiplication on the left by g defines a topological 
homeomorphism of G onto G. From this we infer that A is a Borel set if and only if g- A is 
a Borel set. Therefore the set function j1, is properly defined on B(G). Clearly, yw, inherits 
countable additivity from py and hence, since w.(G) = u(G) < 00, pg is a Borel measure. 
Now suppose yz is a Radon measure. To establish the inner regularity of w,, let O be open 
in G and € > 0. Since p is inner regular and g - O is open, there is a compact set K contained 
in g -O for which p(g-O~K) <e. Hence K’ = g~'! . K is compact, contained in O, and 
beg(O~K’) <. Thus pe is inner regular. A similar argument shows py is outer regular. 
Therefore jy is a Radon measure. 


We now verify (12). Integration is linear. Therefore, if (12) holds for characteristic 
functions of Borel sets it also holds for simple Borel functions. We infer from the Simple 
Approximation Theorem and the Bounded Convergence Theorem that (12) holds for all 
f € C(G) if it holds for simple Borel functions. It therefore suffices to verify (12) in the case 
f = xa, the characteristic function of the Borel set A. However, for such a function, 


[ (e)fdu=nle-A)= | fda 7 


Definition Let G be a compact group. A Borel measure wp: B(G) — [0, 00) is said to be 
left-invariant provided 


p(A) =p(g-A) forallg ¢ Gand A€ BG). — (13) 

It is said to be a probability measure provided y(G) = 1. 
A right-invariant measure is defined similarly. If we consider R” as a topological group 
under the operation of addition, we showed that the restriction of Lebesgue measure p,, on 
4 continuous function on a topological space is measurable with respect to the Borel o-algebra on the space 
and, if the space is compact and the measure is Borel, it is integrable with respect to this measure. Therefore, each 


side of the following formula is properly defined because, for each f € C(G) and g € G, both f and w(g)f are 
continuous functions on the compact topological space G and both yu and jz are Borel measures. 
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R" to B(R") is left-invariant with respect to addition, that is, ,(E +x) = Hn(E) for each 
Borel subset E of R” and each point x € R". Of course, this also holds for any Lebesgue 
measurable subset E of R”. 


Proposition 8 On each compact group G there is a Radon probability measure on B(G) that 
is left-invariant and also one that is right-invariant. 


Proof Theorem 6 tells us that there is a probability functional y € [C(G)]* that is fixed 
under the adjoint of the regular representation on G on C(G). This means that #(1) = 1 and 


W( f) =W(m(g~") f) for all f € C(G) and g €G. (14) 


On the other hand, according to the Riesz-Markov Theorem, there is a unique Radon 
measure yu on B(G) that represents w in the sense that 


Wf) = | fdutorall fe C(9). (15) 
G 
Therefore, by (14), 


W(f) =¥(m(g"') f) = f m(g~') f du for all f ¢ C(G) and g €G. (16) 


Hence, by Lemma 7, 
w(f) = | fa, for all f € C(G) and ge G. 
G 


By the same lemma, ,-1 1s a Radon measure. We infer from the uniqueness of the 
representation of the functional that 
= p,-1 forall ge G. 


Thus p is a left-invariant Radon measure. It is a probability measure because is a probability 
functional and thus 


1=w(1)= f du=w(9). 


A dual argument (see Problem 25) establishes the existence of a right-invariant Radon 
probability measure. O 


Definition Let G be a topological group. A Radon measure on B(G ) is said to be a Haar 
measure provided it is a left-invariant probability measure. 


Theorem 9 (von Neumann) Let G be a compact group. Then there is a unique Haar measure 
pon B(G). The measure p is also right-invariant. 


Proof According to the preceding proposition, there is a left-invariant Radon probability 
measure » on B(G) and a right-invariant Radon probability measure v on B(G). We claim 
that 


[ tau= [sav torat fe (G). (17) 
G G 
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Once this is verified, we infer from the uniqueness of representations of bounded linear 
functionals on C(G) by integration against Radon measures that » = v. Therefore every 
left-invariant Radon measure equals v. Hence there is only one left-invariant Radon measure 
and it is right-invariant. 

To verify (17), let f belong to C(G). Define h: GXG > R by A(x, y) = f(x- y) for 
(x, y) € XG. Then h is a continuous function on G X G. Moreover the product measure 
y X wis defined on a o-algebra of subsets of G x G containing B(G x G). Therefore, since h is 
measurable and bounded on a set G XG of finite v X » measure, it is integrable with respect 
to the product measure v X p over G X G. To verify (17) it suffices to show that 


[_ratexul= [san and [ rawxel= ff fa (18) 


However, by Fubini’s Theorem,° 


[,ratexea= |) fm -)dp(y) 


By the left-invariance of p and (12), 


ie Jau(y) = | Faw forallx <6, 
G G 


dv(x). 


Thus, since »(G) = 1, 


[ ramxel= J fan-o(9)= f rap 


A similar argument establishes the right-hand equality in (18) and thereby completes the 
proof. 


The methods studied here may be extended to show that there is a left-invariant Haar 
measure on any locally compact group G, although it may not be right-invariant. Here we 
investigated one way in which the topology on a topological group determines its measure 
theoretic properties. Of course, it is also interesting to investigate the influence of measure 
on topology. For further study of this interesting circle of ideas it is still valuable to read 
John von Neumann’s classic lecture notes Invariant Measures [vN91]. 


PROBLEMS 


19. Let » be a Borel probability measure on a compact group G. Show that yw is Haar measure if 
and only if 


| foggdp= | f dw for all g € G, f € C(G), 
G G 
where ¢,(g’) = g- a’ forall g’ eG. 
5See the last paragraph of Section 20.1 for an explanation of why, for this product of Borel measures and 


continuous function h, the conclusion of Fubini’s Theorem holds without the assumption that the measure yp is 
complete. 
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20. Let 4. be Haar measure on a compact group G. Show that w X yw is Haar measure on G XG. 


21. Let G be acompact group whose topology is given by a metric. Show that there is a G-invariant 
metric. (Hint: Use the preceding two problems and average the metric over the group G XG.) 
22. Let 4. be Haar measure on a compact group G. If G has infinitely many members, show that 
u({g}) = 0 for each g € G. If G is finite, explicitly describe p. 
23. Show that if 4 is Haar measure on a compact group, then (0) > 0 for every open subset 
O of G. 
24. Let S' = {z = e”| 6 € R} be the circle with the group operation of complex multiplication 
and the topology it inherits from the Euclidean plane. 
(i) Show that S! is a topological group. 
(ii) Define A = {(a, B)|a,B € R,O < B—a< 2z}. For A = (a, B) € A, define I, = 
{e® |a <<}. Show that every proper open subset of S! is the countable disjoint union 
of sets of the form J), A € A. 
(iii) For A = (a, B) € A, define w(Ja) = (8 — a)/27. Define (5S!) = 1. Use part (ii) to 
extend yu to set function defined on the topology T of S'. Then verify that, by Proposition 
9 from the preceding chapter, 4 may be extended to a Borel measure ys on B(S'). 


(iv) Show that the measure defined in part (ii) is Haar measure on S!. 


(v) The torus 7” is the topological group consisting of the Cartesian product of n copies of 
S' with the product topology and group structure. What is Haar measure on TJ”? 


25. Let u be a Borel measure on a topological group G. For a Borel set E, define p'(E) = u(E7'), 
where E~! = {g7! |g € E}. Show that y’ also is a Borel measure. Moreover, show that pz is 
left-invariant if and only if y’ is right-invariant. 


22.4 MEASURE PRESERVING TRANSFORMATIONS AND ERGODICITY: 
THE BOGOLIUBOV-KRILOV THEOREM 


For a measurable space (X, M), a mapping T: X > X is said to be a measurable 
transformation provided for each measurable set E, T~!(E) also is measurable. Observe 
that for a mapping T: X > X, 


T is measurable if and only if g o T is measurable whenever the function g is measurable. 
(19) 
For a measure space (X, M, w), a measurable transformation T: X > X is said to be 
measure preserving provided 


u(T~!(A)) = p(A) for all A eM. 


Proposition 10 Let (X, M, w) be a finite measure space and T: X > X a measurable 
transformation. Then T is measure preserving if and only if g oT is integrable over X whenever 
g is, and 


[ eotdu= | edu foraltg 1x, jb). (20) 
X X 


Proof First assume (20) holds. For A € M, since u(X) < oo, the function g = y,4 belongs 
to L'(X, w) and go T = yp-1(4). We infer from (20) that 4(7~!(A)) = w(A). 
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Conversely, assume 7 is measure preserving. Let g be integrable over X. If g* is the 
positive part of g, then (go 7)* = gt oT. Similarly for the negative part. We may therefore 
assume that g is nonnegative. For a simple function g = S7_, cx - XA,, since T is measure 
preserving, 


[ eorau= | » Ck XA, OF au = | Bacar 
xX X | k=1 X {k=l 


Therefore (20) holds for g simple. According to the Simple Approximation Theorem, there 
is an increasing sequence {g,} of simple functions on X that converge pointwise on X to g. 
Hence {g, o T} is an increasing of simple functions on X that converge pointwise on X to 
goT. Using the Monotone Convergence Theorem twice and the validity of (20) for simple 
functions, we have 


[ eotdu = lim | modu = lim  snau =| edu. 
X noo X n> OO xX 4 [] 


For a measure space (X, M, y.) and measurable transformation T: X > X, a measur- 
able set A is said to be invariant under T (with respect to ~) provided 


p(A~T~1(A)) = w(T7'(A)~A) = 0, 


dp = > cen Ae) = | gdp. 
k=l x 


that is, modulo sets of measure 0, T~!(A) = A. It is clear that 
A is invariant under T if and only if y4 o T = v4 ae. on X. (21) 


If (X, M, pw) also is a probability space, that is, ~( X) = 1, a measure preserving transfor- 
mation T is said to be ergodic provided any set A that is invariant under T with respect to 
has w(A) = Oor p(A) = 1. 


Proposition 11 Let (X, M, yw) be a probability space and T: X — X a measure preserving 
transformation. Then, among real-valued measurable functions g on X, 


T is ergodic if and only if whenever g 0 T = g a.e. on X, then g is constanta.e.on X. (22) 


Proof First assume that whenever go T = g ae. on X, the g is constant a.e. on X. Let 
A € M be invariant under 7. Then g = ya, the characteristic function of A, is measurable 
and ya oT = x, ae. on X. Thus yg is constant a.e., that is, w(A) = 0 or p(A) = 1. 


Conversely, assume T is ergodic. Let g be a real-valued measurable function on X for 
which go T = g ae. on X. Let k be an integer. Define X, = {x € X|k < g(x) << k +1}. 
Then X; is a measurable set that is invariant under T. By the ergodicity of 7, either 
p(X,) = 0 or w(X;,) = 1. The countable collection {X;}xez is disjoint and its union is X. 
Since z(X) = 1 and p is countably additive, 4(X;) = 0, except for exactly one integer k’. 
Define J, = [k’, k’ +1]. Then p{x € X | g(x) € 1} =1 and the length of 1, @(1,), is 1. 


Let n be a natural number for which the descending finite collection {J,}}_, of closed, 
bounded intervals have been defined for which 


€(1,) = 1/24" and w{xeX| e(x)eh}=1forl<k<n. 
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Let In = [an, bn], define cy = (bn — an )/2, 
={xEX| an <e(x) <cn} and By ={xEX| cn <e(x) <b}. 


Then A, and B, are disjoint measurable sets whose union is u{x € X| g(x) € In}, a set of 
measure 1. Since both A, and B, are invariant under 7, we infer from the ergodicity of T 
that exactly one of these sets has measure 1. If (A, ) = 1, define I,,,; = [an, cn]. Otherwise, 
define In41 = [cn, b,]. Then €(In41) = 1/2" and p{x € X| g(x) € In4i} = 1. We have 
inductively defined a descending countable collection {J,}°, of closed, bounded intervals 
such that 

€(In) = 1/2"! and p{x € X| g(x) € I,} =1 for alla. 


By the Nested Set Theorem for the real numbers, there is a number c that belongs to 
every I,. We claim that g = c a.e. on X. Indeed, observe that if g(x) belongs to J, then 
|e(x) —c| < 1/2"-! and therefore 


1=p{xeX| g(x) el,} < pfx eX | lg(x) —c|] < 2") <1. 
Since 
~ -1 
{xe X | s(x) =c}=() {re X| lg(x) —c| < 1/2" \, 
n=1 
we infer from the continuity of measure that 
— i —1t _ 
p{xeX| g(x) =c} = lim pfx € x | lg(x) —c| < 1/2" oe 


O 


Theorem 12 (Bogoliubov-Krilov) Let X be a compact metric space and the mapping f : X > 
X be continuous. Then there is a probability measure pw on the Borel o-algebra B(X ) with 
respect to which f is measure preserving. 


Proof Consider the Banach space C(X) of continuous real-valued functions on X with the 
maximum norm. Since X is a compact metric space, Borsuk’s Theorem tells us that C(X) is 
separable. Let 7 be any Borel probability measure on B(X). Define the sequence {,} of 
linear functionals on C(X) by 


n—-1 
wta)= [PS eos dn for alln € Nand g € C(X). (23) 
k= 


Observe that 
lbn(g)| < ||gllmax for alln e Nand ge C(X). 


Thus {y,} is a bounded sequence in [C(X )]*. Since the Banach space C(X) is separable, we 
infer from Helley’s Theorem that there is a subsequence {,,,} of {y,} that converges, with 
respect to the weak-* topology, to a bounded functional p € [C(X)]*, that is, 


lim n,(g) = #(g) for all ge C(X). 


k—>00 
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Therefore 
fim Yn, (8° f)=W(g0 f) forall ge C(X). 


However, for each k and g € C(X), 


Yn (8° Ff) — Yn (8) = + | [go fret! - ed] 
nk xX 


Take the limit as k — oo and conclude that 


W(go f) =(g) forall g « C(X). (24) 
Since each w, is a positive functional, the limit functional also is positive. The Riesz-Markov 


Theorem tells us that there is a Borel measure yu for which 


W(g)= J edu for all gE C(X). 


We infer from (24) that 


[ eofdu= | gdutorallg €C(x), 
xX xX 


According to Proposition 10, f is measure preserving with respect to w. Finally, for the 
constant function g = 1, #,(g) = 1 for all n. Therefore (2) = 1, that is, w is a probability 
measure. L 


Proposition 13 Let f: X — X be a continuous mapping on a compact metric space X. 
Define M ¢ to be the set of probability measures on B( X) with respect to which f is measure 
preserving. Then a measure yw in Mf is an extreme point of M ; if and only if f is ergodic with 
respect to w. 


Proof First suppose that yw is an extreme point of Mr. To prove that f is ergodic, we assume 
otherwise. Then there is a Borel subset A of X that is invariant under f with respect to uw 
and yet 0 < u(A) <1. Define 


v(E) = p(ENA)/p(A) and n( E) = p( EN [X~A])/u(X~A) for all E € B(X). 
Then, since w(X) = 1, 
p=A-v+(1—A)-n where A= p(A). 


Both v and 7 are Borel probability measures on B( X ). We claim that f is measure preserving 
with respect to each of these measures. Indeed, since f is measure preserving with respect 
to » and A is invariant under f with respect to 4, for each E € B(X), 


m(ENA)=p(f-'(ENA)) =H(f '(E)O fF '(A)) =a f (E) NA). 


Therefore f is invariant with respect to v. By a similar argument, it is also invariant with 
respect to n. Therefore v and 7 belong to M+ and hence yp is not an extreme point of M,. 
Therefore f is ergodic. 
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Now suppose f is ergodic with respect to u € M,. To show that y is an extreme point 
of M;, let A € (0, 1) and v, n € M, be such that 


p= Av+(1—A)n. (25) 


The measure v is absolutely continuous with respect to yw. Since 4(X) < oo, the Radon- 
Nikodym Theorem tells us that there is a function h € L'(X, w) for which 


v(A)= | h dy for all A € B(X). 
A 


It follows from the Simple Approximation Theorem and the Bounded Convergence Theorem 
that 


[ eav= [ ¢-hau forall g ¢L°(X,n). (26) 
X X 


Fix € > 0, and define X, = {x € X | h(x) > 1/A +}. We infer from (25) that 


p(X.) > a | hdu>(1+A-€)- (Xe). 


€ 


Hence p(X.) = 0. Therefore h and h o f are essentially bounded on X with respect to p. 
Hence, using (26), first with g = h o f and then with g = h, and the invariance of f with 
respect to v, we have 


[rophdu= | hofar= | nav= f Pay, 
x xX X x 


We infer from this equality and the invariance of f with respect to yu that 


[tho t— HP dy =| thopPau-2-[ hos hdns | dp 


=2-f du-2- f ho fhdy 
X X 


=2.f Wau—2- f Hdu=0. 
Xx X 


Therefore ho f =h ae. [u] on X. By the ergodicity of f and Proposition 11, there is a 
constant c for which h = c a.e. [u] on X. But pw and v are probability measures and hence 


1=9(x)= | hduw=c-p(X) =c. 

X 

Hence = v and thus w = 7. Therefore ys is an extreme point of M -. L 
Theorem 14 Let f: X — X be a continuous mapping on a compact metric space X. Then 


there is a probability measure . on the Borel o-algebra B(X) with respect to which f is 
ergodic. 
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Proof Let Radon(X) be the Banach space of signed Radon measures on B(X) and the 
linear operator ®: Radon(X) — [C(X)]* be defined by 


P(n)(g) = | gdp for all « € Radon(X) and g € C(X). 
X 


The Riesz Representation Theorem for the dual of C(X) tells us that ® is a linear 
isomorphism of Radon(X) onto [C(X)]*. Define M, to be the set of probability measures 
on B( X) with respect to which f is measure preserving. Then the measure p is an extreme 
point of M, if and only if ®(w) is an extreme point of ®(.M ;). Therefore, by the preceding 
proposition, to prove the theorem we must show that the set ®(.M;) possesses an extreme 
point. According to the Bogoliubov-Krilov Theorem, M, is nonempty. A consequence of 
the Krein-Milman Theorem, Corollary 13 of the preceding chapter, tells us that ®(.M,) 
possesses an extreme point provided it is bounded, convex, and closed with respect to the 
weak-« topology. The Riesz-Markov Theorem tells us that ® defines an isomorphism of 
Radon measures onto positive functionals. The positive functionals are certainly weak-« 
closed, as are the functionals that take the value 1 at the constant function 1. According to 
Proposition 11, a functional » € [C(X)]* is the image under ® of a measure that is invariant 
with respect to f if and only if 


W(go f)—w(g) =O for all ge C(X). 


Fix g € C(X). Evaluation at the function go f — g is a linear functional on [C(X)]* that is 
continuous with respect to the weak-« topology and therefore its kernel is weak-» closed. 
Hence the intersection 


3 
= 


1) {we[c(x)} | ¥(gof) =H(2)} 


geC(X) 


also is a weak-x closed set. This completes the proof of the weak-« closedness of ®(M ;) 
and also the proof of the theorem. LI 


Asymptotic averaging phenomena were originally introduced in the analysis of the 
dynamics of gases. One indication of the significance of ergodicity in the study of such 
phenomena is revealed in the statement of the following theorem. Observe that the right- 
hand side of (27) is independent of the point x € X. 


Theorem 15 Let T be a measure preserving transformation on the probability space 
(X, M, pw). Then T is ergodic if and only if for every g € L'(X, p), 


p(X) 


1 n=l 1 
>> )) = —-- | g dy for almost all x € X. (27) 
Fam x 


A proof of this theorem may be found in the books Introduction to Dynamical 
Systems [BS02] by Michael Brin and Garrett Stuck and Lectures on Ergodic Theory [Hal06] 
by Paul Halmos. These books also contain varied examples of measure preserving and 
ergodic transformations. 
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26. 


27. 


29. 


31. 


32. 


33. 


PROBLEMS 


Let X be a compact metric space. Use the Stone-Weierstrass Theorem to show that the 
Banach space C(X) of continuous functions on X, normed with the maximum norm, is 
separable. 


Does the proof of the the Bogoliubov-Krilov Theorem also provide a proof in the case X is 
compact Hausdorff but not necessarily metrizable? 


. Let (X, M, w) be a finite space and T: X > X a measurable transformation. For a 


measurable function g on X, define the measurable function Ur(g) by Ur(g)(x) = g(T(*)). 
Show that T is measure preserving if and only if for every 1 < p < 00, Ur maps L?(X, p) 
into itself and is an isometry. 


Suppose that T: R” > R’ is linear. Establish necessary and sufficient conditions for T to be 
measure preserving with respect to Lebesgue measure on R”. 


. Let S! = {z = e® | 6 © R} be the circle with the group operation of complex multiplication 


and yz be Haar measure on this group (see Problem 24). Define T: Si > S$! by T(z) = 2’. 
Show that T preserves p. 


Define f: R* — R? by f(x, y) = (2x, y/2). Show that f is measure preserving with respect 
to Lebesgue measure. 


(Poincaré Recurrence) Let T be a measure preserving transformation on a finite measure 
space (X, M, w) and the set A be measurable. Show that for almost all x € X, there are 
infinitely many natural numbers n for which T”(x) belongs to A. 


Let (X, M, w) bea probability space and T: X > X an ergodic transformation. Let the func- 
tion g € L'(X, w) have the property that go T = ga.e. on X. For a natural number n, show 
that there is a unique integer k(n) for which p{x € X|k(n)/n < g(x) < (k(n) +1)/n} =1. 
Then use this to show that ifc = fy gdy, then 


[te-aa 


From this conclude that g = c a.e. and thereby provide a different proof of one implication 
in Proposition 11. 


< 


1 1 
— -p(A) < — foralln e Nand Ae M. 
n n 
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Lindeléf property 
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Probability functional, 484 
Probability measure, 486, 489 
Product measure, 416 
Product topology, 224, 244 
Projection onto a subspace, 266 
Proper mapping, 203 
Proper subset, 3 
Pseudometric, 185 
Pseudonorm, 256 


Quadratic form, 321 


Radon measure, 455 
Radon premeasure, 456 
Radon-Nikodym derivative, 385 
Radon-Nikodym Theorem, 382 
Radon-Riesz Theorem, 168, 315 
Raleigh quotient, 326 
Range of a mapping, 5 
Rapidly Cauchy sequence, 146, 398 
Rational numbers, 11 
Reflexive space, 276 
Reflexivity 
of L?(E), ECR,1< p<1, 284 
of L?(X, w),1< p< ov, 409 
not of C[a, b], 284 
not of L? fa, b]; p = 1, 00, 285 
not of C(X), X infinite, 302 
Regular measure, 471 
Regular representation, 483 
Regular set, 471 
Regular topological space, 227 
Relative complement, 3, 35 
Representation of a group, 480 
Residual set, 214 
Restriction of a mapping, 5 
Riemann integral, 69 
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Riemann integrability 
characterization of, 104 
Riemann-Lebesgue Lemma, 96, 167 
Riemann-Stieltjes integral, 438 
Riesz Representation Theorem 
for the dual of L?(R), 1 < p < ov, 160 
for the dual of C[a, b], 468 
for the dual of C(X), 464 
for the dual of L?(X, w), 1 < p< oo, 400 
Riesz’s Lemma, 261 
Riesz’s Theorem, 251, 261 
Riesz-Fischer Theorem, 148, 398 
Riesz-Fréchet Representation Theorem, 313 
Riesz-Markov Theorem, 458 
Riesz-Schauder Theorem, 331 
Rigid motion, 434 
Ring of sets, 357 


Saturated measure, 342 
Scalar product, 424 
Schauder’s Theorem, 325 
Second category, set of the, 214 
Second countable space, 229 
Self-adjoint operator, 321 
Semialgebra of sets, 357 
Semifinite measure, 342 
Semiring, 354 
Separability 

in a metric space, 204 

in a normed linear space, 152 

in a topological space, 229 

not of L™[a, b], 152 

of C[a, b], 151 

of L?(E),ECR,1 < p<, 152 
Separation 

by a hyperplane, 290 

of points, 248 

of subsets in a topological space, 237 
Sequence 

Cauchy in measure, 365 

pointwise bounded, 207 

uniformly bounded, 207 

uniformly Cauchy, 206 
Sequential compactness 

in a metric space, 199 

in a topological space, 234 
Set 

Borel subset of R 

bounded, 9, 195 

Cantor, 49 


closed, 17 
dense, 12 
finite measure, 340 
meager, 214 
measurable subset of R, 35 
negative, 343 
nowhere dense, 212 
null, 343 
of the first category, 214 
of the second category, 214 
open, 16 
perfect, 53 
positive, 343 
residual, 214 
o-finite measure, 340 
Set function 
countably additive, 338 
countably monotone, 339, 346 
excision property of, 339 
finitely additive, 339 
finitely monotone, 346 
monotonicity, 339 
o-algebra, 38 
o-finite measure, 340 
Signed measure, 342 
Signed Radon measure, 464 
Simple Approximation Lemma, 363 
Simple Approximation Theorem, 62, 363 
Simple function, 61, 362 
Singleton set, 3 
Singular measures, 384 
Singular function, 127 
Sorgenfrey Line, 226 
Space 
Banach space, 255 
dual, 157 
Euclidean, 184 
Hilbert, 310 
locally convex topological vector, 286 
metric, 183 
normed linear, 184 
topological, 223 
Span, 254 
Square root of an operator, 323 
Stone-Weierstrass Approximation Theorem, 
248 
Strictly convex norm, 297 
Strictly increasing function, 50 
Strong topology on a normed space, 275 
Stronger topology, 231 


Subadditive functional, 277 
Subbase for a topology, 224 
Subcover, 18 

Subsequence, 21 

Support of a function, 448 
Supporting set, 295 


~ .Supremum; sup, 9 


~ Symmetric difference of sets, 41, 388 
Symmetric operator, 321 
Symmetry of an inner product, 309 


Tietze Extension Theorem, 241 
Tightness, 98, 377 
Tonelli’s Theorem, 420 
Topological group, 478 
Topological space, 223 
arcwise connected, 238 — 
compact, 234 
connected, 237 
discrete topology, 223 
first countable, 229 
Hausdorff, 227 
metrizable, 229 
normal, 227 
regular, 227 
second countable, 229 
sequentially compact, 234 
subspace topology, 223 
trivial topology, 223 
Topology 
product, 224 
strong topology on a normed space, 275 
weak topology on a normed space, 275 
Torus, 488 
Total variation, 404 
Total variation of a signed measure, 345 
Totally bounded metric space, 198 
Totally ordered set, 6 
Translate, 255 
Translation invariant, 428 
Triangle Inequality, 137, 184 
for real numbers, 9 
Trivial topology, 223 
Tychonoff Product Theorem, 245 
Tychonoff topological space, 227 
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Uniform absolute continuity, 392 
Uniform Boundedness Principle, 269 

in L?,171 
Uniform integrability, 93, 376 
Uniform metric, 206 
Uniformly bounded sequence, 207 
Uniformly Cauchy sequence, 206 
Uniformly continuous function, 26, 192 
Uniformly equicontinuous, 208 
Unit'function, 137 
Unit vector, 255 
Upper Darboux sum, 69 
Upper derivative, 111 
Upper Lebesgue integral, 73 
Upper Riemann integral, 69 . 
Urysohn Metrization Theorem, 229, 242 . 
Urysohn’s Lemma, 239 | 


Vitali Convergence Theorem, 94, 98, 376, 377, 
399 

Vitali covering, 109 

Vitali’s Covering Lemma, 109 

Vitali’s Theorem, 48 

Vitali-Hahn-Saks Theorem, 392 

von Neumann’s proof of the Radon-Nikodym 
Theorem, 387 


Weak topology 


on a topological space, 231 
Weak topology on a linear space, 275 
Weak topology on a normed space, 275 
Weak-+ convergent sequence of functionals, 
275 
Weak-« topology on a dual space, 275 
Weaker topology, 231 
Weakly convergent sequence, 275 
Weakly sequentially compact, 173 
Weierstrass Approximation Theorem, 247 


Young’s Inequality, 140 


Zorn’s Lemma, 6 


